Lecture 2.3

The document discusses error analysis in machine learning, focusing on techniques such as train/test split, validation sets, and performance metrics including confusion matrix, accuracy, precision, recall, F-measure, and ROC curve. It explains the importance of splitting datasets for model validation and tuning hyper-parameters. Additionally, it provides definitions and calculations for various performance metrics used to evaluate classification models.

Uploaded by

22051210

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

3 views

Lecture 2.3

Uploaded by

22051210

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 9

Lecture 2.

• Error Analysis
• Train/Test Split, validation set
• Confusion Matrix
• Accuracy, Precision, Recall, F-measure,
ROC curve,

Dr. Mainak Biswas

Train/Test Split in Machine Learning
• Train-test split is a machine learning technique
that divides a dataset into two subsets: a training
set and a testing set
• It's a model validation process that helps assess
how well a machine learning model will perform
on new data
• Typical Split Ratios
– 80% for training and 20% for testing
– 70% for training and 30% for testing
– 90% for training and 10% for testing (for large dataset)
Dr. Mainak Biswas
Validation Set
• The validation set is an additional subset of the dataset
used to tune the model's hyper-parameters and evaluate
its performance during training
• It acts as an intermediary between the training set and the
test set
• Purpose of a Validation Set
– Hyper parameter Tuning
– Early Stopping
– Model Selection
• Train/Validation/Test Split
– Training Set: Used to train the model
– Validation Set: Used to tune hyper parameters and evaluate the
model during training
– Test Set: Used to assess the final performance on unseen data
Dr. Mainak Biswas
Confusion Matrix

Dr. Mainak Biswas

𝑇𝑃 + 𝑇𝑁
𝐴𝑐𝑐𝑢𝑟𝑎𝑐𝑦 =
𝑇𝑃 + 𝑇𝑁 + 𝐹𝑃 + 𝐹𝑁
Recall measures the proportion of correctly
predicted positive observations out of all
actual positives.
𝑇𝑃 𝑇𝑃
𝑇𝑟𝑢𝑒 𝑝𝑜𝑠𝑖𝑡𝑖𝑣𝑒 𝑟𝑎𝑡𝑒 𝑇𝑃𝑅 , 𝑟𝑒𝑐𝑎𝑙𝑙, 𝑠𝑒𝑛𝑠𝑖𝑡𝑖𝑣𝑖𝑡𝑦 𝑆𝐸𝑁 = =
𝑇𝑃 + 𝐹𝑁 𝑃
Precision measures the proportion of correctly
𝑇𝑃
Precision = predicted positive observations out of all
𝑇𝑃 + 𝐹𝑃 predicted positives.
The F-score (or F1-score) is a metric that combines precision and
recall into a single score, providing a balance between the two.
It's especially useful when the data is imbalanced.
2. 𝑃𝑟𝑒𝑐𝑖𝑠𝑖𝑜𝑛. 𝑅𝑒𝑐𝑎𝑙𝑙
𝐹 − 𝑀𝑒𝑎𝑠𝑢𝑟𝑒 =
𝑃𝑟𝑒𝑐𝑖𝑠𝑖𝑜𝑛 + 𝑅𝑒𝑐𝑎𝑙𝑙
False Positive Rate (FPR) is a measure used in binary classification to quantify how
often a model incorrectly predicts a positive outcome for a negative instance
𝐹𝑃
𝐹𝑎𝑙𝑠𝑒 𝑝𝑜𝑠𝑖𝑡𝑖𝑣𝑒 𝑟𝑎𝑡𝑒 (𝑡𝑦𝑝𝑒 − 𝐼 𝑒𝑟𝑟𝑜𝑟) =
𝐹𝑃 + 𝑇𝑁
Dr. Mainak Biswas
ROC Curve
• An ROC (Receiver Operating Characteristic) plot
is a graphical representation used to evaluate the
performance of a binary classification model
• It illustrates the trade-off between the True
Positive Rate (TPR) and the False Positive Rate
(FPR) at various threshold settings for a classifier.
Here's a breakdown of its meaning and
components
𝑇𝑃 𝑇𝑃
𝑇𝑟𝑢𝑒 𝑝𝑜𝑠𝑖𝑡𝑖𝑣𝑒 𝑟𝑎𝑡𝑒 𝑇𝑃𝑅 , 𝑟𝑒𝑐𝑎𝑙𝑙, 𝑠𝑒𝑛𝑠𝑖𝑡𝑖𝑣𝑖𝑡𝑦 𝑆𝐸𝑁 = =
𝑇𝑃 + 𝐹𝑁 𝑃

𝐹𝑃
𝐹𝑎𝑙𝑠𝑒 𝑝𝑜𝑠𝑖𝑡𝑖𝑣𝑒 𝑟𝑎𝑡𝑒 (𝑡𝑦𝑝𝑒 − 𝐼 𝑒𝑟𝑟𝑜𝑟) =
𝐹𝑃 + 𝑇𝑁
Dr. Mainak Biswas
Components of ROC Curve
• X-axis: False Positive Rate (FPR)
• Y-axis: True Positive Rate (TPR)
• Curve: Plots TPR against FPR for various threshold
values
• Diagonal Line: Represents a random classifier (no
predictive power)
– The area under this line is 0.5
• Area Under the Curve (AUC): The AUC score
measures the overall performance of the model
– An AUC of 1.0 indicates a perfect classifier, while 0.5
indicates a model with no discriminative ability.
Dr. Mainak Biswas
Confusion Matrix Generation
Predicted True
1 1
Actually Actually
1 1 Positive Negative
1 0 (P) (1) (N) (0)
1 1
1 0 Predicted 8 (TP) 5 (FP) 𝐹𝑎𝑙𝑠𝑒 𝑝𝑜𝑠𝑖𝑡𝑖𝑣𝑒 𝑟𝑎𝑡𝑒 (𝑡𝑦𝑝𝑒
1 0 Positive 𝐹𝑃
1 1 − 𝐼 𝑒𝑟𝑟𝑜𝑟) =
1 1 (PP) (1) 𝐹𝑃 + 𝑇𝑁
0 0 5
0 0
Predicted 2 (FN) 5 (TN) = = 0.5
Negative 10
0 1
1 1 (PN) (0)
1 0
0 0
0 0
1 1 8+5
1 1 𝐴𝑐𝑐𝑢𝑟𝑎𝑐𝑦 = = 0.65
1 0 8+5+5+2
0 1
0 0 𝑇𝑃 8
𝑅𝑒𝑐𝑎𝑙𝑙, 𝑠𝑒𝑛𝑠𝑖𝑡𝑖𝑣𝑖𝑡𝑦 𝑆𝐸𝑁 = = = 0.8
𝑇𝑃 + 𝐹𝑁 8 + 2
𝑇𝑃 8 2 × 0.62 × 0.8
Precision = = = 0.62 𝐹 − 𝑀𝑒𝑎𝑠𝑢𝑟𝑒 = = 0.70
𝑇𝑃 + 𝐹𝑃 13 0.62 + 0.8
Dr. Mainak Biswas
ROC Generation
Predicted True
1 1
1 1
1 0
1 1
1 0
1 0
1 1
1 1
0 0
0 0
0 1
1 1
1 0
0 0
0 0
1 1
1 1
1 0
0 1
0 0

Dr. Mainak Biswas

IAM - Yakeen Citizen Info Integration Document V1.0
No ratings yet
IAM - Yakeen Citizen Info Integration Document V1.0
9 pages
Advanced Regression With JMP PRO Handout
No ratings yet
Advanced Regression With JMP PRO Handout
46 pages
PADM - Evaluation and Assessment
100% (1)
PADM - Evaluation and Assessment
41 pages
Module01 LinearRegression
No ratings yet
Module01 LinearRegression
41 pages
Module01.1 LinearRegression
No ratings yet
Module01.1 LinearRegression
32 pages
Supervised Logistic Tutorial Final PDF
No ratings yet
Supervised Logistic Tutorial Final PDF
9 pages
Lesson 5 - Instrumental Variables
No ratings yet
Lesson 5 - Instrumental Variables
14 pages
L01
No ratings yet
L01
29 pages
19-Performance Metrics
No ratings yet
19-Performance Metrics
23 pages
Week 6 Notes
No ratings yet
Week 6 Notes
107 pages
Lecture 1.5-1.6
No ratings yet
Lecture 1.5-1.6
23 pages
Matchse Handout
No ratings yet
Matchse Handout
25 pages
Confusion Matrix
No ratings yet
Confusion Matrix
43 pages
BA quiz 2( 4-7)
No ratings yet
BA quiz 2( 4-7)
14 pages
Pothole Segmentation - CNN
No ratings yet
Pothole Segmentation - CNN
44 pages
School of Computing and Information Systems The University of Melbourne COMP90049 Introduction To Machine Learning (Semester 1, 2022)
No ratings yet
School of Computing and Information Systems The University of Melbourne COMP90049 Introduction To Machine Learning (Semester 1, 2022)
4 pages
AG909 Quantitative Methods For Finance
No ratings yet
AG909 Quantitative Methods For Finance
7 pages
Probit_Logit_Analysis
No ratings yet
Probit_Logit_Analysis
3 pages
Lec 01
No ratings yet
Lec 01
17 pages
Econometrics for Finance Lecture III
No ratings yet
Econometrics for Finance Lecture III
54 pages
IDS22Bayes Applications
No ratings yet
IDS22Bayes Applications
34 pages
StockWatson Econ CH04
No ratings yet
StockWatson Econ CH04
27 pages
Advanced Analytics Complete Notes March 24
No ratings yet
Advanced Analytics Complete Notes March 24
114 pages
Logistic Regression
No ratings yet
Logistic Regression
27 pages
Microelectronics Report
No ratings yet
Microelectronics Report
21 pages
L9_RBF+PM
No ratings yet
L9_RBF+PM
33 pages
02 Operators Updated
No ratings yet
02 Operators Updated
55 pages
Multivariate Data Analysis - CFA
No ratings yet
Multivariate Data Analysis - CFA
60 pages
AI-Lecture 12 - Simple Perceptron
100% (1)
AI-Lecture 12 - Simple Perceptron
24 pages
Economics N
No ratings yet
Economics N
5 pages
Linear Regression
No ratings yet
Linear Regression
64 pages
Answer
100% (3)
Answer
5 pages
Topic 3 - Reliability of Simple Systems
No ratings yet
Topic 3 - Reliability of Simple Systems
28 pages
Lab 04 Handout
No ratings yet
Lab 04 Handout
35 pages
Stat Midterm Revision
No ratings yet
Stat Midterm Revision
20 pages
Cheat Sheet - Machine Learning - Data Science Interview PDF
No ratings yet
Cheat Sheet - Machine Learning - Data Science Interview PDF
16 pages
Project 3 FT-IR krutz
No ratings yet
Project 3 FT-IR krutz
8 pages
Lesson 2 Statistical Inference
No ratings yet
Lesson 2 Statistical Inference
45 pages
第八章
No ratings yet
第八章
28 pages
lecture 7. Typical probability distribution for continuous RV (1)
No ratings yet
lecture 7. Typical probability distribution for continuous RV (1)
29 pages
EEPC102-Module_6-Lesson-2
No ratings yet
EEPC102-Module_6-Lesson-2
12 pages
Lecture Related ICE Location
No ratings yet
Lecture Related ICE Location
24 pages
L3 EDUC7610 Simple Reg
No ratings yet
L3 EDUC7610 Simple Reg
26 pages
Topic 9: Panel Data Models
No ratings yet
Topic 9: Panel Data Models
46 pages
P8120_Lecture_4_2025 - annotated
No ratings yet
P8120_Lecture_4_2025 - annotated
10 pages
CIS 4526: Foundations of Machine Learning Linear Classification: Perceptron
No ratings yet
CIS 4526: Foundations of Machine Learning Linear Classification: Perceptron
33 pages
R Basics: 26-JULY-2019
No ratings yet
R Basics: 26-JULY-2019
32 pages
09 - ML-Model Evaluation
No ratings yet
09 - ML-Model Evaluation
33 pages
Machine_Learning_II
No ratings yet
Machine_Learning_II
61 pages
Soft Computing 300
No ratings yet
Soft Computing 300
7 pages
Detailed_Logistic_Regression
No ratings yet
Detailed_Logistic_Regression
30 pages
Week 7 Lecture Material - Watermark PDF
No ratings yet
Week 7 Lecture Material - Watermark PDF
51 pages
Linear Regression
No ratings yet
Linear Regression
47 pages
100 - Sukrith DNS - IJtte
No ratings yet
100 - Sukrith DNS - IJtte
5 pages
Week 14
No ratings yet
Week 14
10 pages
Metrix in ML
No ratings yet
Metrix in ML
7 pages
Quiz 4
No ratings yet
Quiz 4
2 pages
Bayesian Zero Inflated Negative Binomial Regression Model For The Parkinson Data
No ratings yet
Bayesian Zero Inflated Negative Binomial Regression Model For The Parkinson Data
8 pages
Core Concepts in Real Analysis
From Everand
Core Concepts in Real Analysis
Roshan Trivedi
No ratings yet
Multi-dimensional Monte Carlo Integrations Utilizing Mathematica
From Everand
Multi-dimensional Monte Carlo Integrations Utilizing Mathematica
SUJAUL CHOWDHURY
No ratings yet
Student's Solutions Manual and Supplementary Materials for Econometric Analysis of Cross Section and Panel Data, second edition
From Everand
Student's Solutions Manual and Supplementary Materials for Econometric Analysis of Cross Section and Panel Data, second edition
Jeffrey M. Wooldridge
No ratings yet
What Is VFX
100% (1)
What Is VFX
20 pages
Manual UV 2100
No ratings yet
Manual UV 2100
33 pages
C Range Parts List Book - Ed 2008-03
No ratings yet
C Range Parts List Book - Ed 2008-03
15 pages
VDA 6.3 Audit Report - Quality Capability Assessment: Findings / Requirements
No ratings yet
VDA 6.3 Audit Report - Quality Capability Assessment: Findings / Requirements
4 pages
Material For Fibre Optic Lines PDF
No ratings yet
Material For Fibre Optic Lines PDF
21 pages
Discussion Scope - Expert
No ratings yet
Discussion Scope - Expert
2 pages
Drill Pipe Safety Joint LOGAN
No ratings yet
Drill Pipe Safety Joint LOGAN
3 pages
Microlearning A New
No ratings yet
Microlearning A New
11 pages
D Minor Song Score - Parts
No ratings yet
D Minor Song Score - Parts
6 pages
Power House - Plan
No ratings yet
Power House - Plan
4 pages
AVS1225 Ignition and MAGNETO Lab_5
No ratings yet
AVS1225 Ignition and MAGNETO Lab_5
7 pages
MNM Belt ConveyorAlert
No ratings yet
MNM Belt ConveyorAlert
16 pages
Telecom Core Network Manager Job Description
100% (3)
Telecom Core Network Manager Job Description
2 pages
Sample Risk Register Composable
No ratings yet
Sample Risk Register Composable
16 pages
Macrium Reflect Patch Log
No ratings yet
Macrium Reflect Patch Log
17 pages
Project Astroid
No ratings yet
Project Astroid
13 pages
Barbed Wire Fence at Batticaloa
No ratings yet
Barbed Wire Fence at Batticaloa
8 pages
Atulkumar Bca 5thsem A35404819038 NTCC Amity University Jharkhand
No ratings yet
Atulkumar Bca 5thsem A35404819038 NTCC Amity University Jharkhand
76 pages
MTCP CP1L Client E
No ratings yet
MTCP CP1L Client E
8 pages
CH 4 1
No ratings yet
CH 4 1
45 pages
Tractor Mounted Boom Sprayer
No ratings yet
Tractor Mounted Boom Sprayer
20 pages
License Font in Japan Style Vector Asian Type Japanese Style Abc Alphabet Letter Illustration 11060924
No ratings yet
License Font in Japan Style Vector Asian Type Japanese Style Abc Alphabet Letter Illustration 11060924
2 pages
Runtime R2K Back Up 240 Mins 1800W
No ratings yet
Runtime R2K Back Up 240 Mins 1800W
1 page
CE Application Form 2020
No ratings yet
CE Application Form 2020
2 pages
Oman Tank Terminal Company (Ottco) : Project: Ras Markaz Crude Oil Park Project (Phase 1)
No ratings yet
Oman Tank Terminal Company (Ottco) : Project: Ras Markaz Crude Oil Park Project (Phase 1)
2 pages
Sakata OWC
No ratings yet
Sakata OWC
14 pages
Display Material Document List
No ratings yet
Display Material Document List
7 pages
Electronic License Grant Terms
No ratings yet
Electronic License Grant Terms
8 pages
Instant Access to (Ebook) Machine Intelligence and Signal Processing: Proceedings of International Conference, Misp 2019 by Sonali Agarwal (editor), Shekhar Verma (editor), Dharma P. Agrawal (editor) ISBN 9789811513657, 9811513651 ebook Full Chapters
100% (6)
Instant Access to (Ebook) Machine Intelligence and Signal Processing: Proceedings of International Conference, Misp 2019 by Sonali Agarwal (editor), Shekhar Verma (editor), Dharma P. Agrawal (editor) ISBN 9789811513657, 9811513651 ebook Full Chapters
65 pages

Lecture 2.3

Uploaded by

Lecture 2.3

Uploaded by

Lecture 2.

Dr. Mainak Biswas

Dr. Mainak Biswas

Dr. Mainak Biswas

You might also like