0% found this document useful (0 votes)

2 views

Exp7_MLAI2

The document discusses the importance of evaluation metrics for classification models, emphasizing their role in assessing model accuracy, handling class imbalances, and understanding error types. Key metrics such as Accuracy, Precision, Recall, F1-Score, and AUC-ROC are explained, along with their calculations and significance in evaluating model performance. The use of confusion matrices is highlighted as a fundamental tool for summarizing prediction outcomes and identifying model strengths and weaknesses.

Uploaded by

ashwinitetame6

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

2 views

Exp7_MLAI2

Uploaded by

ashwinitetame6

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 8

EXPERIMENT NO.

Title : Evaluation metrics of Classification Model.

Aim : To analyze and compare the performance of different Classification Models using
various evaluation metrics.
Theory :

Importance of Evaluation Metrics for Classification Models:

Evaluation metrics are crucial for assessing the performance of classification models. They
provide quantitative measures to determine how well a model is performing and guide
improvements. Here’s why they are important:

1. Assess Model Accuracy

• Accuracy is the most straightforward metric, representing the proportion of correct
predictions. However, it is insufficient on its own, especially for imbalanced datasets.
Evaluation metrics provide a fuller picture of model performance.

2. Handle Class Imbalances

• In scenarios where classes are imbalanced, accuracy can be misleading. Metrics like
Precision, Recall, and F1-Score are essential for understanding how well the model
is performing on the minority class, which might be more critical in certain
applications.

3. Understand the Type of Errors

• Metrics such as Precision and Recall help in understanding the types of errors a model
makes. Precision focuses on false positives, while recall emphasizes false negatives.
This is particularly important in applications like medical diagnosis, where different
errors have different consequences.

To evaluate the performance or quality of the model, different metrics are used, and these
metrics are known as performance metrics or evaluation metrics.
In a classification problem, the category or classes of data is identified based on training
data. The model learns from the given dataset and then classifies the new data into classes
or groups based on the training. It predicts class labels as the output, such as Yes or No, 0 or
1, Spam or Not Spam, etc. To evaluate the performance of a classification model, different
metrics are used, and some of them are as follows:
o Accuracy o

Confusion Matrix

o Precision

o Recall o F1-Score

o AUC(Area Under
the Curve)-ROC

1.Accuracy
The accuracy metric is one of the simplest Classification metrics to implement, and it can
be determined as the number of correct predictions to the total number of predictions.
It can be formulated as:

To implement an accuracy metric, we can compare ground truth and predicted values in a
loop, or we can also use the scikit-learn module for this.

2.Confusion Matrix :
A confusion matrix is a tabular representation of prediction outcomes of any binary
classifier, which is used to describe the performance of the classification model on a set of
test data when true values are known.
A confusion matrix is a fundamental tool for evaluating the performance of a classification
model. It provides a summary of the prediction results on a classification problem, showing
how well the model's predictions match the actual labels. The matrix layout enables easy
identification of the types of errors the model is making.

Structure of the Confusion Matrix

For a binary classification problem, the confusion matrix is a 2x2 table that compares the
actual target values with those predicted by the model. The table is organized as follows:
True Positive (TP): The model correctly predicts the positive class.

True Negative (TN): The model correctly predicts the negative class.
False Positive (FP): The model incorrectly predicts the positive class when it's actually
negative (also known as a "Type I error").
False Negative (FN): The model incorrectly predicts the negative class when it's actually
positive (also known as a "Type II error").

Key Metrics Derived from the Confusion Matrix:

Accuracy:

It measures the overall correctness of the model's predictions.

Example :

Output:

Precision :
The precision metric is used to overcome the limitation of Accuracy. The precision
determines the proportion of positive prediction that was actually correct. It can be
calculated as the True Positive or predictions that are actually true to the total positive
predictions (True Positive and False Positive).

Example :

Output :

Recall or Sensitivity :
It is also similar to the Precision metric; however, it aims to calculate the proportion of actual
positive that was identified incorrectly. It can be calculated as True Positive or predictions
that are actually true to the total number of positives, either correctly predicted as positive
or incorrectly predicted as negative (true Positive and false negative).
Example :

Output :

F1-Score :
F-score or F1 Score is a metric to evaluate a binary classification model on the basis of
predictions that are made for the positive class. It is calculated with the help of Precision
and Recall. It is a type of single score that represents both Precision and Recall. So, the F1
Score can be calculated as the harmonic mean of both precision and Recall, assigning equal
weight to each of them.

Example :

Output :
AUC-RUC :
Sometimes we need to visualize the performance of the classification model on charts;
then, we can use the AUC-ROC curve. It is one of the popular and important metrics for
evaluating the performance of the classification model.
Firstly, let's understand ROC (Receiver Operating Characteristic curve) curve. ROC
represents a graph to show the performance of a classification model at different threshold
levels. The curve is plotted between two parameters, which are:

o True Positive Rate

o False Positive Rate

TPR or true Positive rate is a synonym for Recall, hence can be calculated as:

FPR or False Positive Rate can be calculated as:

To calculate value at any point in a ROC curve, we can evaluate a logistic regression
model multiple times with different classification thresholds, but this would not be much
efficient. So, for this, one efficient method is used, which is known as AUC.

AUC: Area Under the ROC curve

AUC is known for Area Under the ROC curve. As its name suggests, AUC calculates the
two-dimensional area under the entire ROC curve, as shown below image:
AUC calculates the performance across all the thresholds and provides an aggregate
measure. The value of AUC ranges from 0 to 1. It means a model with 100% wrong
prediction will have an AUC of 0.0, whereas models with 100% correct predictions will
have an AUC of 1.0.

Example :

Output : AUC – 0.73

Conclusion : In conclusion, the experiment utilizing a confusion matrix to evaluate the

performance of the classification model provides a comprehensive understanding of how
well the model is performing. By breaking down the predictions into true positives, true
negatives, false positives, and false negatives, we can clearly identify the strengths and
weaknesses of the model.

Evaluation Measures
No ratings yet
Evaluation Measures
8 pages
IAI&ML UNIT-5
No ratings yet
IAI&ML UNIT-5
15 pages
Lecture -3
No ratings yet
Lecture -3
24 pages
ML-Lecture-12 (Evaluation Metrics For Classification)
No ratings yet
ML-Lecture-12 (Evaluation Metrics For Classification)
15 pages
Unit 4 Model Evaluation
No ratings yet
Unit 4 Model Evaluation
24 pages
ML CH 5
No ratings yet
ML CH 5
45 pages
AD3501-DL-UNIT 4 NOTES
No ratings yet
AD3501-DL-UNIT 4 NOTES
16 pages
Evaluation Metrics
No ratings yet
Evaluation Metrics
11 pages
Unit 2 Chap 4
No ratings yet
Unit 2 Chap 4
14 pages
Chap3 Part1 Classification
No ratings yet
Chap3 Part1 Classification
38 pages
Machine Learningassignment
No ratings yet
Machine Learningassignment
10 pages
Confusion Matrix & Evaluation Metrics in Machine Learning
No ratings yet
Confusion Matrix & Evaluation Metrics in Machine Learning
23 pages
11.2 - Classification Evaluation Metrics
No ratings yet
11.2 - Classification Evaluation Metrics
22 pages
Confusion Matrix V 2.0
No ratings yet
Confusion Matrix V 2.0
14 pages
Performance Metrics
No ratings yet
Performance Metrics
12 pages
Machine Learning Model Evaluation
No ratings yet
Machine Learning Model Evaluation
11 pages
19-Performance Metrics
No ratings yet
19-Performance Metrics
23 pages
Lecture 2.3
No ratings yet
Lecture 2.3
9 pages
How to evaluate and monitor performance of AI models for Financial Risk Management— a practical guide by Indraneel Dutta Barua
No ratings yet
How to evaluate and monitor performance of AI models for Financial Risk Management— a practical guide by Indraneel Dutta Barua
1 page
06-FSSR_DS610_2024=2025T1_ٍMetrics
No ratings yet
06-FSSR_DS610_2024=2025T1_ٍMetrics
24 pages
Module 01 - Performance Metrics in ML (1)
No ratings yet
Module 01 - Performance Metrics in ML (1)
15 pages
Model Evaluation Metrics - A Comprehensive Guide For Beginners - by Yash - Medium
No ratings yet
Model Evaluation Metrics - A Comprehensive Guide For Beginners - by Yash - Medium
9 pages
Evaluation Metrics-ML
No ratings yet
Evaluation Metrics-ML
16 pages
ML Metrics
No ratings yet
ML Metrics
9 pages
Lecture-(3-4) Evaluation Metrices Classification and Regression
No ratings yet
Lecture-(3-4) Evaluation Metrices Classification and Regression
28 pages
Performance Metrics Classification (1)
No ratings yet
Performance Metrics Classification (1)
39 pages
WINSEM2024-25_CBS3006_ETH_VL2024250505168_2025-01-09_Reference-Material-IV
No ratings yet
WINSEM2024-25_CBS3006_ETH_VL2024250505168_2025-01-09_Reference-Material-IV
20 pages
Module 5 ML
No ratings yet
Module 5 ML
12 pages
Lec_4_ML_S4_Evaluation_Metrics
No ratings yet
Lec_4_ML_S4_Evaluation_Metrics
29 pages
Notes 03
No ratings yet
Notes 03
38 pages
Model Performance Assessment
No ratings yet
Model Performance Assessment
13 pages
Performance Parameters
No ratings yet
Performance Parameters
23 pages
DL_IT324a_4
No ratings yet
DL_IT324a_4
52 pages
Classification Metrics
No ratings yet
Classification Metrics
24 pages
IT 138 - Lecture 4
No ratings yet
IT 138 - Lecture 4
30 pages
Confusion Matrix
No ratings yet
Confusion Matrix
8 pages
Module 7 - Evaluation Measures
No ratings yet
Module 7 - Evaluation Measures
27 pages
Session 1 Evaluation Model
No ratings yet
Session 1 Evaluation Model
58 pages
Confusion Matrix
No ratings yet
Confusion Matrix
43 pages
unit-6_notes_PART_A
No ratings yet
unit-6_notes_PART_A
20 pages
2. Performance Measures
No ratings yet
2. Performance Measures
19 pages
Lecture 5
No ratings yet
Lecture 5
21 pages
Ai DS 2 Book-Chpt-5
No ratings yet
Ai DS 2 Book-Chpt-5
17 pages
Understanding the Confusion Matrix in Machine Learning
No ratings yet
Understanding the Confusion Matrix in Machine Learning
4 pages
جلسه 13
No ratings yet
جلسه 13
76 pages
Tutorial 6 Evaluation Metrics For Machine Learning Models: Classification and Regression Models
No ratings yet
Tutorial 6 Evaluation Metrics For Machine Learning Models: Classification and Regression Models
22 pages
6.evaluation Metrics - UNIT 2
No ratings yet
6.evaluation Metrics - UNIT 2
4 pages
Evaluation Metrics
No ratings yet
Evaluation Metrics
20 pages
Learning Best Practices For Model Evaluation and Hyper-Parameter Tuning
No ratings yet
Learning Best Practices For Model Evaluation and Hyper-Parameter Tuning
20 pages
EvaluationMatrix
No ratings yet
EvaluationMatrix
29 pages
Confusion Matrix in Machine Learning
No ratings yet
Confusion Matrix in Machine Learning
10 pages
Unit2- Perfomance Measures
No ratings yet
Unit2- Perfomance Measures
32 pages
Model Evaluation and Selection
No ratings yet
Model Evaluation and Selection
49 pages
Imp Notes For Aamd
No ratings yet
Imp Notes For Aamd
6 pages
BigData section6
No ratings yet
BigData section6
10 pages
March_3rd&4th
No ratings yet
March_3rd&4th
19 pages
Classification Matrics
No ratings yet
Classification Matrics
18 pages
Confusion Matrix in Machine Learning
No ratings yet
Confusion Matrix in Machine Learning
2 pages
Confusion Matrix
No ratings yet
Confusion Matrix
4 pages
Multi-dimensional Monte Carlo Integrations Utilizing Mathematica
From Everand
Multi-dimensional Monte Carlo Integrations Utilizing Mathematica
SUJAUL CHOWDHURY
No ratings yet
Practice Quiz 2
100% (1)
Practice Quiz 2
2 pages
SRP PPT 16032016
No ratings yet
SRP PPT 16032016
151 pages
4. HSG ANH 9 TỈNH PHÚ THỌ (2010-2011)
No ratings yet
4. HSG ANH 9 TỈNH PHÚ THỌ (2010-2011)
7 pages
RCMS Technique Manual
No ratings yet
RCMS Technique Manual
41 pages
Account: 0015104000663267 From: 23/09/2022: Transaction Statement
No ratings yet
Account: 0015104000663267 From: 23/09/2022: Transaction Statement
6 pages
Front End - Replacing Tie-Rod Ends PDF
No ratings yet
Front End - Replacing Tie-Rod Ends PDF
4 pages
Sistem Perencanaan Tata Ruang Wilayah Pesisir Studi Kasus Teluk Lampung
No ratings yet
Sistem Perencanaan Tata Ruang Wilayah Pesisir Studi Kasus Teluk Lampung
686 pages
PM Debug Info
No ratings yet
PM Debug Info
89 pages
Science Brochure
No ratings yet
Science Brochure
2 pages
Scheduled Commuter & NSOP Airplanes Deficiency Report: SL No Provision Deficiencies Ref CAR / Standards
No ratings yet
Scheduled Commuter & NSOP Airplanes Deficiency Report: SL No Provision Deficiencies Ref CAR / Standards
3 pages
PROJECT_MANAGEMENT-AN_WAY_TO_OVERCOME_THE_FAILURE_
No ratings yet
PROJECT_MANAGEMENT-AN_WAY_TO_OVERCOME_THE_FAILURE_
4 pages
A Study On Customer Satisfaction Towards After Sales Service Provided by Apple Inc With Special Reference To Coimbatore City.
No ratings yet
A Study On Customer Satisfaction Towards After Sales Service Provided by Apple Inc With Special Reference To Coimbatore City.
6 pages
Topic 5
No ratings yet
Topic 5
15 pages
CSR - 40 - mP3000 Parts Catalogue
No ratings yet
CSR - 40 - mP3000 Parts Catalogue
47 pages
VDA 6.3 Audit Report - Quality Capability Assessment: Findings / Requirements
No ratings yet
VDA 6.3 Audit Report - Quality Capability Assessment: Findings / Requirements
4 pages
Tin Can Stirling Engine
No ratings yet
Tin Can Stirling Engine
16 pages
Reliability and MTBF Overview
No ratings yet
Reliability and MTBF Overview
10 pages
FINAL EXAM - GR 8
No ratings yet
FINAL EXAM - GR 8
9 pages
Source Defense - The Essential Guide To PCI DSS 6.4.3 and 11.6.1
No ratings yet
Source Defense - The Essential Guide To PCI DSS 6.4.3 and 11.6.1
14 pages
LAPJV_算法_线性分配
No ratings yet
LAPJV_算法_线性分配
16 pages
Atulkumar Bca 5thsem A35404819038 NTCC Amity University Jharkhand
No ratings yet
Atulkumar Bca 5thsem A35404819038 NTCC Amity University Jharkhand
76 pages
Method Statement
No ratings yet
Method Statement
2 pages
02 Cadfil - Price - Option - Matrix - V13
No ratings yet
02 Cadfil - Price - Option - Matrix - V13
1 page
Question Paper - (Electrical) : Maximum Marks: 50 Time Allowed: 1 Hour
No ratings yet
Question Paper - (Electrical) : Maximum Marks: 50 Time Allowed: 1 Hour
6 pages
5
No ratings yet
5
153 pages
TowerXchange Meetup Europe
No ratings yet
TowerXchange Meetup Europe
26 pages
Soil - Mechanics - Chapter - 4
No ratings yet
Soil - Mechanics - Chapter - 4
88 pages
EPTD W12-New Voltage Control
No ratings yet
EPTD W12-New Voltage Control
20 pages
OLED Technology
100% (1)
OLED Technology
21 pages
Day 1 Part 2 TOGAF Content Meta Model
No ratings yet
Day 1 Part 2 TOGAF Content Meta Model
23 pages

Exp7_MLAI2

Uploaded by

Exp7_MLAI2

Uploaded by

EXPERIMENT NO.

Title : Evaluation metrics of Classification Model.

Importance of Evaluation Metrics for Classification Models:

1. Assess Model Accuracy

2. Handle Class Imbalances

3. Understand the Type of Errors

Structure of the Confusion Matrix

Key Metrics Derived from the Confusion Matrix:

It measures the overall correctness of the model's predictions.

o True Positive Rate

o False Positive Rate

FPR or False Positive Rate can be calculated as:

AUC: Area Under the ROC curve

Output : AUC – 0.73

Conclusion : In conclusion, the experiment utilizing a confusion matrix to evaluate the

You might also like