Naive Bayes Classifier PDF

This document provides an overview of Naive Bayes classifiers. It begins with background on probabilistic classification models, including discriminative vs generative models. It then explains the probability basics needed, such as prior, conditional, and joint probabilities. Next, it describes how Naive Bayes classifiers make the assumption that attributes are conditionally independent given the class. An example of classifying whether to play tennis is provided to demonstrate the learning and testing phases. Finally, it discusses some relevant issues with the Naive Bayes approach, such as its independence assumption sometimes being violated and the zero probability problem.

Uploaded by

Pooja Racha

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

110 views17 pages

Naive Bayes Classifier PDF

Uploaded by

Pooja Racha

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 17

Naïve Bayes Classifier

Ke Chen

COMP24111 Machine Learning

Outline
• Background
• Probability Basics
• Probabilistic Classification
• Naïve Bayes
• Example: Play Tennis
• Relevant Issues
• Conclusions

2
COMP24111 Machine Learning
Background
• There are three methods to establish a classifier
a) Model a classification rule directly
Examples: k-NN, decision trees, perceptron, SVM
b) Model the probability of class memberships given input data
Example: perceptron with the cross-entropy cost
c) Make a probabilistic model of data within each class
Examples: naive Bayes, model based classifiers
• a) and b) are examples of discriminative classification
• c) is an example of generative classification
• b) and c) are both examples of probabilistic classification

3
COMP24111 Machine Learning
Probability Basics
• Prior, conditional and joint probability for random variables
– Prior probability: P(X)
– Conditional probability: P(X1 |X2 ), P(X2 |X1 )
– Joint probability: X  (X1 , X2 ), P(X)  P(X1 ,X2 )
– Relationship: P(X1 ,X2 )  P(X2 |X1 )P(X1 )  P(X1 |X2 )P(X2 )
– Independence: P(X2 |X1 )  P(X2 ), P(X1 |X2 )  P(X1 ), P(X1 ,X2 )  P(X1 )P(X2 )
• Bayesian Rule

P(X|C)P(C) Likelihood Prior

P(C|X)  Posterior 
P(X) Evidence

4
COMP24111 Machine Learning
Probability Basics
• Quiz: We have two six-sided dice. When they are tolled, it could end up
with the following occurance: (A) dice 1 lands on side “3”, (B) dice 2 lands
on side “1”, and (C) Two dice sum to eight. Answer the following questions:
1) P( A)  ?
2) P(B)  ?
3) P(C)  ?
4) P( A| B)  ?
5) P(C | A)  ?
6) P( A , B)  ?
7) P( A , C )  ?
8) Is P( A , C ) equal to P(A)  P(C)?
5
COMP24111 Machine Learning
Probabilistic Classification
• Establishing a probabilistic model for classification
– Discriminative model
P(C|X) C  c1 ,  ,cL , X  (X1 ,  , Xn )

P(c1 |x) P(c2 |x) P(c L |x)



Discriminative
Probabilistic Classifier


x1 x2 xn
x  ( x1 , x2 ,  , xn )
6
COMP24111 Machine Learning
Probabilistic Classification
• Establishing a probabilistic model for classification (cont.)
– Generative model

P(X|C) C  c1 ,  ,cL , X  (X1 ,  , Xn )

P(x|c1 ) P(x|c2 ) P(x|c L )

Generative Generative Generative

Probabilistic Model Probabilistic Model  Probabilistic Model
for Class 1 for Class 2 for Class L
  
x1 x2 x n x1 x2 xn x1 x2 xn

x  ( x1 , x2 ,  , xn )
7
COMP24111 Machine Learning
Probabilistic Classification
• MAP classification rule
– MAP: Maximum A Posterior
– Assign x to c* if
P(C  c* |X  x)  P(C  c|X  x) c  c* , c  c1 ,  , cL

• Generative classification with the MAP rule

– Apply Bayesian rule to convert them into posterior probabilities
P( X  x |C  ci )P(C  ci )
P(C  ci |X  x) 
P( X  x)
 P( X  x |C  ci )P(C  ci )
for i  1,2 ,  , L
– Then apply the MAP rule

9
COMP24111 Machine Learning
Naïve Bayes
• Naïve Bayes Algorithm (for discrete input attributes)
– Learning Phase: Given a training set S,
For each target value of ci (ci  c1 ,  , c L )
Pˆ (C  ci )  estimate P(C  ci ) with examples in S;
For every attributevalue x jk of each attributeX j ( j  1,  , n; k  1,  , N j )
Pˆ ( X j  x jk |C  ci )  estimate P( X j  x jk |C  ci ) with examples in S;

Output: conditional probability tables; for X j , N j  L elements

– Test Phase: Given an unknown instance X  (a1 ,  , an ),
Look up tables to assign the label c* to X’ if
[ Pˆ ( a1 |c* )    Pˆ ( an |c* )]Pˆ (c* )  [ Pˆ ( a1 |c)    Pˆ ( an |c)]Pˆ (c), c  c* , c  c1 ,  , cL

10
COMP24111 Machine Learning
Example
• Example: Play Tennis

11
COMP24111 Machine Learning
Example
• Learning Phase
Outlook Play=Yes Play=No Temperature Play=Yes Play=No
Sunny 2/9 3/5 Hot 2/9 2/5
Overcast 4/9 0/5 Mild 4/9 2/5
Rain 3/9 2/5 Cool 3/9 1/5

Humidity Play=Yes Play=No Wind Play=Yes Play=No

High 3/9 4/5 Strong 3/9 3/5
Normal 6/9 1/5 Weak 6/9 2/5

P(Play=Yes) = 9/14 P(Play=No) = 5/14

12
COMP24111 Machine Learning
Example
• Test Phase
– Given a new instance,
x’=(Outlook=Sunny, Temperature=Cool, Humidity=High, Wind=Strong)
– Look up tables
P(Outlook=Sunny|Play=Yes) = 2/9 P(Outlook=Sunny|Play=No) = 3/5
P(Temperature=Cool|Play=Yes) = 3/9 P(Temperature=Cool|Play==No) = 1/5
P(Huminity=High|Play=Yes) = 3/9 P(Huminity=High|Play=No) = 4/5
P(Wind=Strong|Play=Yes) = 3/9 P(Wind=Strong|Play=No) = 3/5
P(Play=Yes) = 9/14 P(Play=No) = 5/14

Given the fact P(Yes|x’) < P(No|x’), we label x’ to be “No”.

13
COMP24111 Machine Learning
Example
• Test Phase
– Given a new instance,
x’=(Outlook=Sunny, Temperature=Cool, Humidity=High, Wind=Strong)
– Look up tables
P(Outlook=Sunny|Play=Yes) = 2/9 P(Outlook=Sunny|Play=No) = 3/5
P(Temperature=Cool|Play=Yes) = 3/9 P(Temperature=Cool|Play==No) = 1/5
P(Huminity=High|Play=Yes) = 3/9 P(Huminity=High|Play=No) = 4/5
P(Wind=Strong|Play=Yes) = 3/9 P(Wind=Strong|Play=No) = 3/5
P(Play=Yes) = 9/14 P(Play=No) = 5/14

Given the fact P(Yes|x’) < P(No|x’), we label x’ to be “No”.

14
COMP24111 Machine Learning
Relevant Issues
• Violation of Independence Assumption
– For many real world tasks, P(X1 ,  , Xn |C)  P(X1 |C)    P(Xn |C)
– Nevertheless, naïve Bayes works surprisingly well anyway!
• Zero conditional probability Problem
– If no example contains the attribute value X j  a jk , Pˆ ( X j  a jk |C  ci )  0
– In this circumstance, Pˆ ( x1 |ci )    Pˆ ( a jk |ci )    Pˆ ( xn |ci )  0 during test
– For a remedy, conditional probabilities estimated with
n  mp
Pˆ ( X j  a jk |C  ci )  c
nm
nc : number o f training examples fo r whic h X j  a jk and C  ci
n : number o f training examples fo r whic h C  ci
p : prio r estimate (usually, p  1 / t fo r t po ssible values o f X j )
m : weight to prio r (number o f " virtual" examples, m  1)
15
COMP24111 Machine Learning
Relevant Issues
• Continuous-valued Input Attributes
– Numberless values for an attribute
– Conditional probability modeled with the normal distribution
1  ( X j   ji )2 
Pˆ ( X j |C  ci )  exp  
2  ji  2 ji2 
 
 ji : mean (avearag e)o f attribute values X j o f examples fo r whic h C  ci
 ji : standard deviatio n o f attribute values X j o f examples fo r whic h C  ci

– Learning Phase: for X  (X1 ,  , Xn ), C  c1 ,  , cL

Output: n L normal distributions and P(C  ci ) i  1,  , L
– Test Phase: for X  (X1 ,  , Xn )
• Calculate conditional probabilities with all the normal distributions
• Apply the MAP rule to make a decision

16
COMP24111 Machine Learning
Conclusions
• Naïve Bayes based on the independence assumption
– Training is very easy and fast; just requiring considering each
attribute in each class separately
– Test is straightforward; just looking up tables or calculating
conditional probabilities with normal distributions
• A popular generative model
– Performance competitive to most of state-of-the-art classifiers
even in presence of violating independence assumption
– Many successful applications, e.g., spam mail filtering
– A good candidate of a base learner in ensemble learning
– Apart from classification, naïve Bayes can do more…
17
COMP24111 Machine Learning

2022 Naive Bayes and Probability
No ratings yet
2022 Naive Bayes and Probability
30 pages
Fridman LexPhD
No ratings yet
Fridman LexPhD
67 pages
Naive Bayes Classifier PDF
No ratings yet
Naive Bayes Classifier PDF
17 pages
Submitted By: POOJA.R-160615733108 M.VAISHNAVI-160615733078 M.NAVYA-160615733077
No ratings yet
Submitted By: POOJA.R-160615733108 M.VAISHNAVI-160615733078 M.NAVYA-160615733077
15 pages
Deep Learning Interview Questions
No ratings yet
Deep Learning Interview Questions
17 pages
Atns Insurance Sales Automation
No ratings yet
Atns Insurance Sales Automation
20 pages
Weather Forecasting Basepaper
100% (1)
Weather Forecasting Basepaper
14 pages
Career Objective
No ratings yet
Career Objective
5 pages
Naive Abst
No ratings yet
Naive Abst
1 page
Black Box Testing
50% (2)
Black Box Testing
7 pages
Naive Bayes Classification
100% (3)
Naive Bayes Classification
10 pages
Detailed Guide 7 Loss Functions Machine Learning Python Code
No ratings yet
Detailed Guide 7 Loss Functions Machine Learning Python Code
16 pages
NNFL CBCGS Syllabus
No ratings yet
NNFL CBCGS Syllabus
8 pages
Lecture Notes For Chapter 1: by Tan, Steinbach, Kumar
No ratings yet
Lecture Notes For Chapter 1: by Tan, Steinbach, Kumar
37 pages
L10-Naive Bayes Continuous
No ratings yet
L10-Naive Bayes Continuous
16 pages
AI Lec 04+05 - Naive Bayes
No ratings yet
AI Lec 04+05 - Naive Bayes
55 pages
Bayesian Learning: Berrin Yanikoglu
No ratings yet
Bayesian Learning: Berrin Yanikoglu
64 pages
Irs Unit 4 CH 1
No ratings yet
Irs Unit 4 CH 1
58 pages
Bayesian
No ratings yet
Bayesian
91 pages
Naïve Bayes Classifier: Ke Chen
No ratings yet
Naïve Bayes Classifier: Ke Chen
18 pages
Naïve Bayes Classifier: Ke Chen
No ratings yet
Naïve Bayes Classifier: Ke Chen
19 pages
Naïve Bayes Classifier: Ke Chen
No ratings yet
Naïve Bayes Classifier: Ke Chen
20 pages
Naïve Bayes Classifier: Dr. Hussain Dawood
No ratings yet
Naïve Bayes Classifier: Dr. Hussain Dawood
20 pages
Naïve Bayes Classifier: Ke Chen
No ratings yet
Naïve Bayes Classifier: Ke Chen
18 pages
Data Preprocessing
No ratings yet
Data Preprocessing
12 pages
Naive Bayes Classifier
No ratings yet
Naive Bayes Classifier
24 pages
Naïve Bayes Classifier: Adopted From Slides by Ke Chen From University of Manchester and Yangqiu Song From Msra
No ratings yet
Naïve Bayes Classifier: Adopted From Slides by Ke Chen From University of Manchester and Yangqiu Song From Msra
25 pages
Bayesian Learning
No ratings yet
Bayesian Learning
49 pages
Week 4 - Classification Alternative Techniques
No ratings yet
Week 4 - Classification Alternative Techniques
87 pages
Naïve Bayes Classifier: Ke Chen
No ratings yet
Naïve Bayes Classifier: Ke Chen
20 pages
Bayesian Learning: Based On "Machine Learning", T. Mitchell, Mcgraw Hill, 1997, Ch. 6
No ratings yet
Bayesian Learning: Based On "Machine Learning", T. Mitchell, Mcgraw Hill, 1997, Ch. 6
54 pages
Naïve Bayes Classifier
No ratings yet
Naïve Bayes Classifier
17 pages
Bayes Classification Methods
No ratings yet
Bayes Classification Methods
22 pages
ML Lecture#5
No ratings yet
ML Lecture#5
65 pages
Souza 2018
No ratings yet
Souza 2018
14 pages
Data Classification and Prediction : Lecture-11
No ratings yet
Data Classification and Prediction : Lecture-11
36 pages
Bayes Algorithm
No ratings yet
Bayes Algorithm
26 pages
Probabilistic Class I Fiers
No ratings yet
Probabilistic Class I Fiers
5 pages
Lecture - 4.1 - Bayes Classifier
No ratings yet
Lecture - 4.1 - Bayes Classifier
31 pages
ANN Theory
No ratings yet
ANN Theory
49 pages
Naive Bayes
No ratings yet
Naive Bayes
18 pages
8 ML
No ratings yet
8 ML
22 pages
Emilydavis
No ratings yet
Emilydavis
7 pages
IV - CSE - Data Warehousing and Data Mining
No ratings yet
IV - CSE - Data Warehousing and Data Mining
4 pages
Lecture 5-Naïve Bayes
No ratings yet
Lecture 5-Naïve Bayes
26 pages
DM See M4
No ratings yet
DM See M4
8 pages
Proposal
No ratings yet
Proposal
12 pages
Naive Bayes
No ratings yet
Naive Bayes
31 pages
2 Naive Bayes
No ratings yet
2 Naive Bayes
49 pages
Data Structures: by Baidyanath Sou Dept. of Computer Science. J.K. College
No ratings yet
Data Structures: by Baidyanath Sou Dept. of Computer Science. J.K. College
37 pages
Heart Disease Prediction Using ML
No ratings yet
Heart Disease Prediction Using ML
48 pages
Application of Artificial Neural Network in Market Segmentation: A Review On Recent Trends
No ratings yet
Application of Artificial Neural Network in Market Segmentation: A Review On Recent Trends
24 pages
Classification - Naive Bayes
No ratings yet
Classification - Naive Bayes
17 pages
Lecture 3
No ratings yet
Lecture 3
6 pages
Chap 6 Classifying Co-Ops (2) Revised
No ratings yet
Chap 6 Classifying Co-Ops (2) Revised
4 pages
Unit 3 Bayesian Learning
No ratings yet
Unit 3 Bayesian Learning
49 pages
Smart Traffic Monitoring System
No ratings yet
Smart Traffic Monitoring System
27 pages
ML QB
No ratings yet
ML QB
13 pages
JPNR 2022 S03 199
No ratings yet
JPNR 2022 S03 199
5 pages
Image Caption Generator Synopsis
No ratings yet
Image Caption Generator Synopsis
13 pages
L3 (Week3) Bayesian Classifier
No ratings yet
L3 (Week3) Bayesian Classifier
21 pages
Lecture 7
No ratings yet
Lecture 7
15 pages
Naive Bayes Classifier
No ratings yet
Naive Bayes Classifier
10 pages
Ba Yes Naive
No ratings yet
Ba Yes Naive
15 pages
Lecture 4 Classification P1
No ratings yet
Lecture 4 Classification P1
50 pages
Lecture13 Nbayes
No ratings yet
Lecture13 Nbayes
56 pages
R Random Forest Guide
No ratings yet
R Random Forest Guide
8 pages
Unit 4 (Ensemble Methods)
No ratings yet
Unit 4 (Ensemble Methods)
24 pages
Tugas PPT Metopen
No ratings yet
Tugas PPT Metopen
12 pages
23-Naive Bayes
No ratings yet
23-Naive Bayes
22 pages
6 Naive-Bayes
No ratings yet
6 Naive-Bayes
18 pages
Statistical Perspective
No ratings yet
Statistical Perspective
85 pages
Data Mining - Module 7
No ratings yet
Data Mining - Module 7
8 pages
Unit 3
No ratings yet
Unit 3
99 pages
Lect 7 DM
No ratings yet
Lect 7 DM
65 pages
SC Practicals1 To 10 Black Book
No ratings yet
SC Practicals1 To 10 Black Book
32 pages
MLS 1 - Decision Trees and Random Forests
No ratings yet
MLS 1 - Decision Trees and Random Forests
16 pages
Naive Bayes
No ratings yet
Naive Bayes
25 pages
K Means Final
No ratings yet
K Means Final
10 pages
Naive Bayes
No ratings yet
Naive Bayes
29 pages
Lec04 Classifiers NBC
No ratings yet
Lec04 Classifiers NBC
24 pages
ML Unit 2
No ratings yet
ML Unit 2
107 pages
UNIT4 - Part2 Aiml
No ratings yet
UNIT4 - Part2 Aiml
46 pages
Slide07 Bayes
No ratings yet
Slide07 Bayes
51 pages
Lesson 6.0 Supervised Learning With Naive Bayes Classifiers
No ratings yet
Lesson 6.0 Supervised Learning With Naive Bayes Classifiers
13 pages
Nayes Bayes Classifier
No ratings yet
Nayes Bayes Classifier
46 pages
ml3 - Text Classification - Naive Bayes
No ratings yet
ml3 - Text Classification - Naive Bayes
50 pages
Predicting and Forecasting of Vehicle Charging Station Using ECNN With DHFO Algorithm
No ratings yet
Predicting and Forecasting of Vehicle Charging Station Using ECNN With DHFO Algorithm
25 pages
05 Classification II 2024
No ratings yet
05 Classification II 2024
54 pages
L4 Naive Bayes
No ratings yet
L4 Naive Bayes
31 pages
Chapter 4
No ratings yet
Chapter 4
22 pages
Fundamentals of Satellite Remote Sensing Emilio Chuvieco PDF Download
No ratings yet
Fundamentals of Satellite Remote Sensing Emilio Chuvieco PDF Download
66 pages
M11 Final Document
No ratings yet
M11 Final Document
82 pages
ML 05 Bayesian Classifier
No ratings yet
ML 05 Bayesian Classifier
19 pages
A-level Maths Revision: Cheeky Revision Shortcuts
From Everand
A-level Maths Revision: Cheeky Revision Shortcuts
Scool Revision
3.5/5 (8)
Numerical Analysis II Essentials
From Everand
Numerical Analysis II Essentials
The Editors of REA
No ratings yet