0% found this document useful (0 votes)

80 views34 pages

L05 - Advance Analytical Theory and Methods - Classification

Naive Bayes classification is a simple statistical method for classification that is based on applying Bayes' theorem with strong independence assumptions. It calculates the probability that a given tuple belongs to a particular class based on the values of predictor variables, assuming predictor variables are independent of each other. To classify a new tuple, it predicts the class with the highest conditional probability given the values of the predictor variables.

Uploaded by

isuru gunathilaka

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

80 views34 pages

L05 - Advance Analytical Theory and Methods - Classification

Uploaded by

isuru gunathilaka

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 34

Advance Analytical Theory and

Methods: Classification
Madava Viranjan

1
What is Classification?
• Classification is a form of data analysis that extracts models describing important data classes

• These models or classifiers predict categorical class labels

• The classifier is presented with a set of examples that are already classified and from these
examples the classifier learns to assign a class label for unseen example

• Classification is a two-step process consisting a learning step and a classification step

• Since we use a dataset with the class labels to train the classifier, classification comes under
the supervised learning mechanism

2
Some Examples
• A bank officer needs to analyze a loan application as safe or not. Categorical prediction as
“safe” or “risky”

• A marketing manager of a reputed electronic shop needs to identify whether given

customer profile will buy a computer or not. Categorical prediction as “yes” or “no”

• A medical researcher wants to analyze breast cancer data to predict which one of three
specific treatments a patient should receive. Categorical prediction as “treatment A”,
“treatment B”, “treatment C”

3
Classification vs Numeric Prediction
• In previous example if the marketing manager of the electronic shop wants to know; How
much a given customer will spend in one visit? then this task is a numeric prediction.

• In numeric prediction we are working with continuous-valued functions but in

classification we work with categorical values

• The regression analysis is one of the most common numeric prediction methodology.

4
Steps in Classification
1. Learning step
 Construct the classifier by learning from the training dataset

5
Steps in Classification
2. Classification step
 The model is used for classification.
 Accuracy of the prediction need to be measured. For this if we used the same training dataset that will
generate optimistic measures as the classifier tends to overfit the data. Therefore separate dataset
called test dataset is needed

6
Decision Trees

7
Introduction
• Uses a tree structure to specify sequences of
decisions and consequences.

• Input to a decision tree can be categorical or

continuous.

• Nodes
• Tests some attribute
• Root node is the top node
• Internal nodes are the decision or test points
• Leaf nodes are at the end of the last branch,
and they represent the class label
8
Introduction ctd.
• Branching of a node is called as split

• For each split it is required to use the most informative attribute as the splitting attribute

• The common way of selecting the most informative attribute is entropy-based methods

• At the end this seems to be a bunch of if-else statements. So, why it is considered under
the data mining or machine learning?

• Consider below dataset and identify how data points will be arranged in the given
decision tree
9
10
X0 <= -2

X0 <= 2

X1 <= 2

11
Splitting an Attribute
• At any splitting point, an attribute with maximum Information Gain will be selected as
the splitting attribute

pi = probability of class i

Wi = size of a child relative to the parent

12
Splitting an Attribute: Example
• In previous dataset there are two attributes which can be used to split. Consider below
node which checks for the given attribute value. Show which attribute should be selected
as the splitting attribute based on the information gain.

A A
X0 <= -2 X1 <= 0.8

B D
C E

13
How the Algorithm Works?
• The algorithm is called with three parameters
• D – Data partition. Initially, it is the complete set of training tuples
• Attribute_list – list of attributes describing the tuples
• Attribute_selection_method – procedure for selecting the attribute which best discriminate the given tuples

step1: Tree starts with a single Node ‘N’

step2: if all the tuples in D are in same class then ‘N’ is a leaf
step3: otherwise attribute_selection_method determines splitting criterion
step4: tuples in D are partition accordingly
• Discrete values
• Continuous value
• Discrete values and binary tree

14
How the Algorithm Works?
• The expected information needed to classify a tuple in D is given by,

• The amount of more information require to arrives at an exact classification,

• Information Gain that can obtain from the partitioning,

15
16
RID Age Income Student Credit_rating Class: buys_computer

1 Youth High No Fair No

2 Youth High No Excellent No
3 Middle High No Fair Yes
4 Senior Medium No Fair Yes
5 Senior Low Yes Fair Yes
6 Senior Low Yes Excellent No
7 Middle Low Yes Excellent Yes
8 Youth Medium No Fair No
9 Youth Low Yes Fair Yes
10 Senior Medium Yes Fair Yes
11 Youth Medium Yes Excellent Yes
12 Middle Medium No Excellent Yes
13 Middle High Yes Fair Yes
14 Senior Medium No Excellent No 17
Tree Pruning
• After constructing a tree there will be many branches due to anomalies of data
• In tree pruning it removes some least reliable branches
• Pruned trees are tend to be smaller, less complex, and faster in classifying

• Prepruning
• Tree is pruned by halting the construction.
• Eg: Decide not to further split the subset of a given node. In that current node will become a leaf.
• Based on statistical significance, information gain, Gini index some threshold will be set.

• Postpruning
• Remove sub trees from fully grown tree
• Most common approach

18
Why Decision Trees are Popular?
• Do not require any domain knowledge to construct a decision tree

• Can handle multidimensional data

• Since knowledge is represented as trees it is easy to understandable by humans

• The learning and classifications steps are fast

• Provide good accuracy

19
Problems With Decision Trees
• Repetition and replication of tree branches cause to large trees

• Loading entire dataset into memory

20
Statistical
Classification

21
Naïve Bayes Classification
• Algorithm based on the Bayes theorem

• Assumes the conditional independence of attributes

• Compute the probability that a tuple belongs to a particular class

• Bayes Theorem
• The conditional probability of event C occurring, given that event A has already occurred,

22
Naïve Bayes Classification
• Predicts that tuple belongs to a class Ci if and only if;

• How to maximize )?
𝑃 ( 𝑋|𝐶 𝑖 ) . 𝑃 (𝐶 𝑖 )
𝑃 ( 𝐶 𝑖| 𝑋 ) =
𝑃 ( 𝑋)

• Calculate conditional probabilities for attributes is required

23
RID Age Income Student Credit_rating Class: buys_computer

1 Youth High No Fair No

2 Youth High No Excellent No
3 Middle High No Fair Yes
4 Senior Medium No Fair Yes
5 Senior Low Yes Fair Yes
6 Senior Low Yes Excellent No
7 Middle Low Yes Excellent Yes
8 Youth Medium No Fair No
9 Youth Low Yes Fair Yes
10 Senior Medium Yes Fair Yes
11 Youth Medium Yes Excellent Yes
12 Middle Medium No Excellent Yes
13 Middle High Yes Fair Yes
14 Senior Medium No Excellent No 24
Naïve Bayes Classification
X = (age = Youth, Income = Medium, Student = Yes, Credit_rating = fair)

X = (age = Senior, Income = High, Student = No, Credit_rating = Excellent)

• What if one probability becomes 0?

• Use Laplasian correction

25
Naïve Bayes Classification

Text Tag
“A great game” Sports
“The election was over” Not sports
“Very clean match” Sports
“A clean but forgettable game” Sports
“It was a close election” Not sports

“A very close game” Sport or Not Sport?

26
Rule-based
Classification

27
Rule-based Classification
• Use IF-THEN rules for classification
• IF age = youth AND student = yes THEN buys_computer =yes

• Coverage and Accuracy

• What is the coverage and accuracy of R1?

28
Evaluate
Classifier
Performance

29
Outcome of the Classification
• True Positive (TP)
• Positive tuples that were correctly labeled by the Classifier

• True Negative (TN)

• Negative tuples that were correctly labeled by the Classifier

• False Positive (FP)

• Negative tuples that were incorrectly labeled as Positive by the Classifier

• False Negative (FN)

• Positive tuples that were incorrectly labeled as Negative by the Classier
30
Evaluate the Outcome
• Accuracy

• Error Rate

31
Evaluate the Outcome

Classes Buys_computer= Yes Buys_computer= No Total

Buys_computer= Yes 6954 46 7000
Buys_computer= No 412 2588 3000
Total 7366 2634 10000

32
Evaluate the Outcome
• Sensitivity
• True positive (recognition rate)

• Specificity
• True negative rate

33
Evaluate the Outcome
• What can we tell about below classification results?

Classes Cancer = Yes Cancer = No Total

Cancer = Yes 90 210 300
Cancer = No 140 9560 9700
Total 230 9770 10000

Salesforce AI Specialist v25.10.1 - 25 Ilncna
No ratings yet
Salesforce AI Specialist v25.10.1 - 25 Ilncna
12 pages
Customer Churn - E-Commerce: Capstone Project Report
100% (1)
Customer Churn - E-Commerce: Capstone Project Report
43 pages
ch4 23 11 2023
100% (1)
ch4 23 11 2023
81 pages
20461C 00
100% (1)
20461C 00
7 pages
Classification and Prediction
No ratings yet
Classification and Prediction
143 pages
Final - Unit 3 Data Preprocessing - Phases
No ratings yet
Final - Unit 3 Data Preprocessing - Phases
42 pages
Data Mining
No ratings yet
Data Mining
87 pages
Lesson1 - Data Definitions
No ratings yet
Lesson1 - Data Definitions
57 pages
Lesson 6 Data Life Cycle Part 2
No ratings yet
Lesson 6 Data Life Cycle Part 2
30 pages
Descriptive Statistics
No ratings yet
Descriptive Statistics
22 pages
Descriptive Data Analytics
No ratings yet
Descriptive Data Analytics
56 pages
Big Data - S
No ratings yet
Big Data - S
79 pages
Module No 5 Relational Database Design
No ratings yet
Module No 5 Relational Database Design
160 pages
Training in R For Data Statistics
No ratings yet
Training in R For Data Statistics
113 pages
Report Design & Data Monitor Using Businessobjects Dashboard Design
No ratings yet
Report Design & Data Monitor Using Businessobjects Dashboard Design
74 pages
Session 3 4 Data Literacy Privacy Ethics
100% (1)
Session 3 4 Data Literacy Privacy Ethics
19 pages
SQL
No ratings yet
SQL
101 pages
Examples On Triggers: Instructor: Mohamed Eltabakh Meltabakh@cs - Wpi.edu
No ratings yet
Examples On Triggers: Instructor: Mohamed Eltabakh Meltabakh@cs - Wpi.edu
15 pages
Unit-2 SQL Updated
No ratings yet
Unit-2 SQL Updated
102 pages
Perl Tutorial
No ratings yet
Perl Tutorial
32 pages
DBMS - Module 3 Ppts - Jan28th (Autosaved)
No ratings yet
DBMS - Module 3 Ppts - Jan28th (Autosaved)
104 pages
Introduction To R: Shanti.S.Chauhan, PH.D Business Studies Shuats
No ratings yet
Introduction To R: Shanti.S.Chauhan, PH.D Business Studies Shuats
53 pages
RDBMS
No ratings yet
RDBMS
155 pages
Module 4 SQL
No ratings yet
Module 4 SQL
151 pages
DBMS Module 2
No ratings yet
DBMS Module 2
125 pages
DataMining S
No ratings yet
DataMining S
103 pages
Final - DBMS UNIT-5
No ratings yet
Final - DBMS UNIT-5
181 pages
Subqueries
No ratings yet
Subqueries
32 pages
SQL Basic
100% (1)
SQL Basic
53 pages
Structured Query Language (SQL)
No ratings yet
Structured Query Language (SQL)
145 pages
Advanced SQL - LAB 2
No ratings yet
Advanced SQL - LAB 2
11 pages
Unit 2 Da
No ratings yet
Unit 2 Da
69 pages
Big Data Analytics and Visualization Lab
No ratings yet
Big Data Analytics and Visualization Lab
193 pages
Study Material DF
No ratings yet
Study Material DF
152 pages
Emerging Technology Chapter 1
No ratings yet
Emerging Technology Chapter 1
35 pages
Lesson 3 Big Data Overview
No ratings yet
Lesson 3 Big Data Overview
30 pages
Mana Mohan R
No ratings yet
Mana Mohan R
147 pages
02 - Data Preparation and Cleaning
No ratings yet
02 - Data Preparation and Cleaning
16 pages
CH 5
No ratings yet
CH 5
80 pages
4-Stored Procedures
No ratings yet
4-Stored Procedures
22 pages
Unit 01
No ratings yet
Unit 01
32 pages
Dsc652 - Chapter 1 Introduction To Big Data Systems
No ratings yet
Dsc652 - Chapter 1 Introduction To Big Data Systems
27 pages
Data Mining Techniques Unit-1
No ratings yet
Data Mining Techniques Unit-1
122 pages
Fundamentals of Data Science
No ratings yet
Fundamentals of Data Science
1 page
Lecture - 04 - Data Understanding and Preparation
No ratings yet
Lecture - 04 - Data Understanding and Preparation
59 pages
Advanced SQL - LAB 1
No ratings yet
Advanced SQL - LAB 1
12 pages
DBMS Module 1
No ratings yet
DBMS Module 1
56 pages
SQL Introduction
No ratings yet
SQL Introduction
96 pages
Lesson 2 Linear Regression
100% (1)
Lesson 2 Linear Regression
21 pages
Business Operations and Analytics
No ratings yet
Business Operations and Analytics
33 pages
Module 4 Transaction Processing
100% (1)
Module 4 Transaction Processing
94 pages
DBMS Module1 Part1
No ratings yet
DBMS Module1 Part1
66 pages
Unit 6
No ratings yet
Unit 6
143 pages
Lab Answer Key: Module 1: Introduction To Microsoft SQL Server 2014 Lab: Working With SQL Server 2014 Tools
No ratings yet
Lab Answer Key: Module 1: Introduction To Microsoft SQL Server 2014 Lab: Working With SQL Server 2014 Tools
173 pages
L9 SQL
No ratings yet
L9 SQL
128 pages
Windows Forensics - Exercises
No ratings yet
Windows Forensics - Exercises
47 pages
1.data Representation A Level
No ratings yet
1.data Representation A Level
128 pages
Introduction To Data Mining
No ratings yet
Introduction To Data Mining
19 pages
Basic SQL: IS 2511 - Fundamentals of Database Systems
No ratings yet
Basic SQL: IS 2511 - Fundamentals of Database Systems
53 pages
Lecture1 Big Data
No ratings yet
Lecture1 Big Data
47 pages
Advanced SQL - LAB 3
No ratings yet
Advanced SQL - LAB 3
21 pages
Data Mining UNIT-III R20 Syllabus
No ratings yet
Data Mining UNIT-III R20 Syllabus
50 pages
Application-Final
No ratings yet
Application-Final
3 pages
SGR NEW @1-1
No ratings yet
SGR NEW @1-1
14 pages
USB 845X HardwareSoftware
No ratings yet
USB 845X HardwareSoftware
427 pages
PN532 - Usb8452
No ratings yet
PN532 - Usb8452
19 pages
2022 Grade 7 3rd Tem History
No ratings yet
2022 Grade 7 3rd Tem History
5 pages
2022 Grade 07 Geography 3rd Term Test Paper With Answers North Western Province
No ratings yet
2022 Grade 07 Geography 3rd Term Test Paper With Answers North Western Province
12 pages
2022 Grade 7 3rd Tem Buddhism
No ratings yet
2022 Grade 7 3rd Tem Buddhism
5 pages
New PPT
No ratings yet
New PPT
98 pages
Encryption & Decryption Apk
No ratings yet
Encryption & Decryption Apk
27 pages
(Ebook) Ensemble Methods for Machine Learning - MEAP Version 6 by Gautam Kunapuli ISBN 9781617297137, 1617297135 - Own the ebook now and start reading instantly
100% (3)
(Ebook) Ensemble Methods for Machine Learning - MEAP Version 6 by Gautam Kunapuli ISBN 9781617297137, 1617297135 - Own the ebook now and start reading instantly
75 pages
Implementation of Time Series Forecasting
No ratings yet
Implementation of Time Series Forecasting
12 pages
Srs of Major Project Rough
No ratings yet
Srs of Major Project Rough
10 pages
Detecting Cybersecurity Attacks Across Different Network Features and Learners
No ratings yet
Detecting Cybersecurity Attacks Across Different Network Features and Learners
29 pages
Vieira, 2021
No ratings yet
Vieira, 2021
15 pages
Hyperparameter Tuning
No ratings yet
Hyperparameter Tuning
7 pages
A Comparative Analysis of Deep Learning Models For Flower Recognition and Health Prediction Proposal
No ratings yet
A Comparative Analysis of Deep Learning Models For Flower Recognition and Health Prediction Proposal
7 pages
FCVM 09 945726
No ratings yet
FCVM 09 945726
22 pages
jurnal 7
No ratings yet
jurnal 7
11 pages
Nail Disease PREDICTION
No ratings yet
Nail Disease PREDICTION
34 pages
PPT CNN Fall detection
No ratings yet
PPT CNN Fall detection
25 pages
Machine learning notes
No ratings yet
Machine learning notes
53 pages
1 s2.0 S0360544221004898 Main
No ratings yet
1 s2.0 S0360544221004898 Main
14 pages
A Comparative Study on Different Transfer Learning Approaches for Identification of Plant Diseases
No ratings yet
A Comparative Study on Different Transfer Learning Approaches for Identification of Plant Diseases
6 pages
URL Based Phishing Website Detection by Using Gradient and Catboost Algorithms
No ratings yet
URL Based Phishing Website Detection by Using Gradient and Catboost Algorithms
8 pages
ML - LAB - FILE Pankaj
No ratings yet
ML - LAB - FILE Pankaj
13 pages
AI paper 12th SET A
No ratings yet
AI paper 12th SET A
5 pages
GenAI Use
No ratings yet
GenAI Use
33 pages
32 AI Exp2
No ratings yet
32 AI Exp2
5 pages
Amazon Sentiment Analysis Documentation
No ratings yet
Amazon Sentiment Analysis Documentation
4 pages
7 Ann Multilayer Perceptron Full
No ratings yet
7 Ann Multilayer Perceptron Full
69 pages
Cross Match
No ratings yet
Cross Match
11 pages
Deep Learning Based Optimization in Massive MIMO Systems
No ratings yet
Deep Learning Based Optimization in Massive MIMO Systems
34 pages
6. Practical Machine Learning-1
No ratings yet
6. Practical Machine Learning-1
5 pages
Chapter4 Machine Learning Part1
No ratings yet
Chapter4 Machine Learning Part1
39 pages
Chapter-14 Data Science
No ratings yet
Chapter-14 Data Science
12 pages

L05 - Advance Analytical Theory and Methods - Classification

Uploaded by

L05 - Advance Analytical Theory and Methods - Classification

Uploaded by

Advance Analytical Theory and

• These models or classifiers predict categorical class labels

• Classification is a two-step process consisting a learning step and a classification step

• A marketing manager of a reputed electronic shop needs to identify whether given

• In numeric prediction we are working with continuous-valued functions but in

• Input to a decision tree can be categorical or

Wi = size of a child relative to the parent

step1: Tree starts with a single Node ‘N’

• The amount of more information require to arrives at an exact classification,

• Information Gain that can obtain from the partitioning,

1 Youth High No Fair No

• Can handle multidimensional data

• Since knowledge is represented as trees it is easy to understandable by humans

• The learning and classifications steps are fast

• Provide good accuracy

• Loading entire dataset into memory

• Assumes the conditional independence of attributes

• Compute the probability that a tuple belongs to a particular class

• Calculate conditional probabilities for attributes is required

1 Youth High No Fair No

X = (age = Senior, Income = High, Student = No, Credit_rating = Excellent)

• What if one probability becomes 0?

“A very close game” Sport or Not Sport?

• Coverage and Accuracy

• What is the coverage and accuracy of R1?

• True Negative (TN)

• False Positive (FP)

• False Negative (FN)

Classes Buys_computer= Yes Buys_computer= No Total

Classes Cancer = Yes Cancer = No Total

You might also like