0% found this document useful (0 votes)

41 views5 pages

Confusion Matrix

The document discusses the importance of various metrics for evaluating classification models, especially in the context of imbalanced datasets. It introduces key metrics such as accuracy, precision, recall, and F1-score, explaining their definitions, use-cases, and limitations. Additionally, it provides examples of how to compute these metrics using Python and emphasizes the significance of a confusion matrix in understanding model performance.

Uploaded by

Arkojyoti Dey

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

41 views5 pages

Confusion Matrix

Uploaded by

Arkojyoti Dey

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

🎯 Why Are These Metrics Important?

When we build a classification model, we need a way to measure how well it performs — especially
when the data is imbalanced (e.g., 90% healthy, 10% sick).

💡 Confusion Matrix (The Base of Everything)

A confusion matrix is a summary table for classification results:

lua Copy Edit

Predicted
0 | 1
-----------------
Actual 0 | TN | FP
1 | FN | TP

Where:
TP (True Positive): Correctly predicted positive class
TN (True Negative): Correctly predicted negative class
FP (False Positive): Incorrectly predicted positive (Type I error)
FN (False Negative): Incorrectly predicted negative (Type II error)

✅ Accuracy
Definition:

Percentage of total predictions that are correct.

TP + TN
Accuracy =
TP + TN + FP + FN

Use-case:
Good for balanced datasets.

Weakness:
Can be misleading for imbalanced data.

Example:

python Copy Edit

from sklearn.metrics import accuracy_score y_true = [1, 0, 1, 1, 0, 1, 0] y_pred = [1, 0,

1, 0, 0, 1, 1] print("Accuracy:", accuracy_score(y_true, y_pred))

Output:
sql Copy Edit

Accuracy: 0.714 (5 out of 7 predictions are correct)

🎯 Precision
Definition:

Of all predicted positives, how many were actually positive?

TP
Precision =
TP + FP

Use-case:
Important when false positives are costly (e.g., spam detection, cancer diagnosis).

Example:

python Copy Edit

from sklearn.metrics import precision_score print("Precision:", precision_score(y_true,

y_pred))

Output:

sql Copy Edit

Precision: 0.75 (3 correct positives out of 4 predicted positives)

🎯 Recall (Sensitivity or True Positive Rate)

Definition:

Of all actual positives, how many were correctly predicted?

TP
Recall =
TP + FN

Use-case:
Important when missing positives is dangerous (e.g., detecting fraud or cancer).

Example:

python Copy Edit

from sklearn.metrics import recall_score print("Recall:", recall_score(y_true, y_pred))

Output:

kotlin Copy Edit

Recall: 0.75 (3 out of 4 actual positives were caught)

🎯 F1-Score
Definition:

Harmonic mean of Precision and Recall — a balanced metric.

Precision × Recall
F1 = 2 ×
Precision + Recall

Use-case:
When you want to balance precision and recall, especially on imbalanced datasets.

Example:

python Copy Edit

from sklearn.metrics import f1_score print("F1-Score:", f1_score(y_true, y_pred))

Output:

makefile Copy Edit

F1-Score: 0.75

🧮 Full Breakdown with Confusion Matrix

Let's compute it manually to understand better:

python Copy Edit

from sklearn.metrics import confusion_matrix cm = confusion_matrix(y_true, y_pred)

print("Confusion Matrix:\n", cm)

Output:

lua Copy Edit

[[2 1]
[1 3]]

From this:
TP = 3, TN = 2, FP = 1, FN = 1

Now manually:

python Copy Edit

TP = 3 TN = 2 FP = 1 FN = 1 accuracy = (TP + TN) / (TP + TN + FP + FN) precision = TP /

(TP + FP) recall = TP / (TP + FN) f1 = 2 * (precision * recall) / (precision + recall)
print("Manual Accuracy:", accuracy) print("Manual Precision:", precision) print("Manual
Recall:", recall) print("Manual F1-Score:", f1)

📊 Visualize Metrics (Bar Plot)

python Copy Edit

import matplotlib.pyplot as plt metrics = ['Accuracy', 'Precision', 'Recall', 'F1-Score']

values = [accuracy, precision, recall, f1] plt.figure(figsize=(8,5)) plt.bar(metrics,
values, color='skyblue') plt.ylim(0, 1) plt.title('Performance Metrics')
plt.ylabel('Score') plt.grid(True, linestyle='--', alpha=0.5) plt.show()

✅ Summary Table
Metric Best When... Worst When...

Accuracy Data is balanced Data is imbalanced

Precision False positives are costly You need to catch all positives

Recall False negatives are costly False positives don’t matter as

much

F1-Score Need balance between precision & You care only about one (P or R)
recall

📌 Bonus: Classification Report

Scikit-learn gives all metrics per class:

python
Copy Edit
from sklearn.metrics import classification_report print(classification_report(y_true,
y_pred, target_names=['Class 0', 'Class 1']))

Output:

markdown Copy Edit

precision recall f1-score support

Class 0 0.67 0.67 0.67 3

Class 1 0.75 0.75 0.75 4

accuracy 0.71 7
macro avg 0.71 0.71 0.71 7
weighted avg 0.71 0.71 0.71 7

Would you like the same deep explanation for macro/micro/weighted averaging, ROC AUC, or how to
use these metrics with multiclass or multilabel classification?

CHAPTER 03 - 2nd Part - Properties of Pure Fluids-May20
100% (1)
CHAPTER 03 - 2nd Part - Properties of Pure Fluids-May20
49 pages
Unit-6 Notes PART A
No ratings yet
Unit-6 Notes PART A
20 pages
Lecture - 3
No ratings yet
Lecture - 3
24 pages
Imp Notes For Aamd
No ratings yet
Imp Notes For Aamd
6 pages
ML Lecture 11 Evaluation
No ratings yet
ML Lecture 11 Evaluation
17 pages
9.1 Accuracy: Formula: Accuracy (True Positives + True Negatives) / (Total Observations)
No ratings yet
9.1 Accuracy: Formula: Accuracy (True Positives + True Negatives) / (Total Observations)
4 pages
ML Model Evaluation Metrics
No ratings yet
ML Model Evaluation Metrics
8 pages
03 Performance Metrics
No ratings yet
03 Performance Metrics
15 pages
ML CH 5
No ratings yet
ML CH 5
45 pages
Model Evaluation Metrics - A Comprehensive Guide For Beginners - by Yash - Medium
No ratings yet
Model Evaluation Metrics - A Comprehensive Guide For Beginners - by Yash - Medium
9 pages
Confusion Matrix For Your Multi-Class Machine Learning Model - by Joydwip Mohajon - Towards Data Science
No ratings yet
Confusion Matrix For Your Multi-Class Machine Learning Model - by Joydwip Mohajon - Towards Data Science
9 pages
Performance Metrics
No ratings yet
Performance Metrics
34 pages
Performance Metrics Classification
No ratings yet
Performance Metrics Classification
39 pages
Comprehensive Guide On Confusion Matrix 1657202063
No ratings yet
Comprehensive Guide On Confusion Matrix 1657202063
5 pages
19-Performance Metrics
No ratings yet
19-Performance Metrics
23 pages
Machine Learning: Professor Department of Computer Science & Engineering
No ratings yet
Machine Learning: Professor Department of Computer Science & Engineering
53 pages
11.2 - Classification Evaluation Metrics
No ratings yet
11.2 - Classification Evaluation Metrics
22 pages
Ads 5
No ratings yet
Ads 5
5 pages
BigData Section6
No ratings yet
BigData Section6
10 pages
Machine Learningassignment
No ratings yet
Machine Learningassignment
10 pages
AIML Ritesh
No ratings yet
AIML Ritesh
18 pages
Classification Metrics Guide
No ratings yet
Classification Metrics Guide
15 pages
Exp7 MLAI2
No ratings yet
Exp7 MLAI2
8 pages
جلسه 13
No ratings yet
جلسه 13
76 pages
Unit 4 Model Evaluation
No ratings yet
Unit 4 Model Evaluation
24 pages
Unit 4
No ratings yet
Unit 4
20 pages
Module 01 - Performance Metrics in ML
No ratings yet
Module 01 - Performance Metrics in ML
15 pages
Lec 12 13 Evaluation Measures
No ratings yet
Lec 12 13 Evaluation Measures
45 pages
Lecture - (3-4) Evaluation Metrices Classification and Regression
No ratings yet
Lecture - (3-4) Evaluation Metrices Classification and Regression
28 pages
Evaluation Metrics: Yining Chen (Adapted From Slides by Anand Avati) May 1, 2020
No ratings yet
Evaluation Metrics: Yining Chen (Adapted From Slides by Anand Avati) May 1, 2020
31 pages
Classification Metrics
No ratings yet
Classification Metrics
24 pages
Confusion Matrix
No ratings yet
Confusion Matrix
11 pages
WINSEM2024-25 CBS3006 ETH VL2024250505168 2025-01-09 Reference-Material-IV
No ratings yet
WINSEM2024-25 CBS3006 ETH VL2024250505168 2025-01-09 Reference-Material-IV
20 pages
W6 CSE 4781 Classification Metrics
No ratings yet
W6 CSE 4781 Classification Metrics
28 pages
Unit - 5
No ratings yet
Unit - 5
57 pages
ML Model Evaluation Metrics
No ratings yet
ML Model Evaluation Metrics
11 pages
Module 2
No ratings yet
Module 2
151 pages
Understanding F1 Score, Accuracy, ROC-AUC & PR-AUC Metrics
No ratings yet
Understanding F1 Score, Accuracy, ROC-AUC & PR-AUC Metrics
10 pages
ML Metrics
No ratings yet
ML Metrics
9 pages
F1 Score
No ratings yet
F1 Score
14 pages
Performance Metrics
No ratings yet
Performance Metrics
12 pages
Machine Learning II
No ratings yet
Machine Learning II
61 pages
COnfusion Matrix
No ratings yet
COnfusion Matrix
32 pages
IT 138 - Lecture 4
No ratings yet
IT 138 - Lecture 4
30 pages
Intermediate Analytics-Regression-Week 3-1
No ratings yet
Intermediate Analytics-Regression-Week 3-1
44 pages
Ads Exp4
No ratings yet
Ads Exp4
3 pages
Confusion Matrix - Explained
No ratings yet
Confusion Matrix - Explained
6 pages
Confusion Matrix
No ratings yet
Confusion Matrix
43 pages
Performance Parameters
No ratings yet
Performance Parameters
14 pages
Instruction & Option Choice
No ratings yet
Instruction & Option Choice
6 pages
Data Scientists' Guide to Metrics
No ratings yet
Data Scientists' Guide to Metrics
70 pages
Confusion Matrix in Machine Learning
No ratings yet
Confusion Matrix in Machine Learning
10 pages
8 2 Lecture AI 8 2
No ratings yet
8 2 Lecture AI 8 2
35 pages
Unit 3
No ratings yet
Unit 3
13 pages
Confusion Matrix
No ratings yet
Confusion Matrix
16 pages
Chapter 5 Model Evaluation
No ratings yet
Chapter 5 Model Evaluation
21 pages
3-Performance Measures
No ratings yet
3-Performance Measures
35 pages
Evaluation Metrics in Machine Learning - GeeksforGeeks
No ratings yet
Evaluation Metrics in Machine Learning - GeeksforGeeks
6 pages
l09 Machine Learning
No ratings yet
l09 Machine Learning
39 pages
Performance Metrics ML
No ratings yet
Performance Metrics ML
4 pages
EXP-1-To Implement Linear Regression
No ratings yet
EXP-1-To Implement Linear Regression
5 pages
Statistical Mechanics Overview
No ratings yet
Statistical Mechanics Overview
21 pages
Econometrics: CLRM Assumptions Guide
No ratings yet
Econometrics: CLRM Assumptions Guide
13 pages
Homework 9 QMB 3200
No ratings yet
Homework 9 QMB 3200
22 pages
ML Evaluation Metrics CheatSheet
No ratings yet
ML Evaluation Metrics CheatSheet
3 pages
Interim Layout
No ratings yet
Interim Layout
9 pages
Quiz Feedback 1
No ratings yet
Quiz Feedback 1
5 pages
OLS Assumptions and Diagnostics
No ratings yet
OLS Assumptions and Diagnostics
18 pages
CH3. Multiple Linear Regression 2023
No ratings yet
CH3. Multiple Linear Regression 2023
76 pages
DH 301 - Basic Epidemiology - Department Academic Mentorship Program
No ratings yet
DH 301 - Basic Epidemiology - Department Academic Mentorship Program
5 pages
Chapter 13 Part 1
No ratings yet
Chapter 13 Part 1
49 pages
Business Statistics - II Syllabus
No ratings yet
Business Statistics - II Syllabus
2 pages
10 RD
No ratings yet
10 RD
16 pages
RA-1 Regression
No ratings yet
RA-1 Regression
32 pages
Statistics For Assistant Director
No ratings yet
Statistics For Assistant Director
2 pages
BUS204 Final Assignment 1
No ratings yet
BUS204 Final Assignment 1
4 pages
Complete Forecasting Exercise Solutions
No ratings yet
Complete Forecasting Exercise Solutions
5 pages
Econometrics Assignment MBA - 2
No ratings yet
Econometrics Assignment MBA - 2
3 pages
STA302 Week11 Full
No ratings yet
STA302 Week11 Full
49 pages
Unit 3
No ratings yet
Unit 3
87 pages
Largersampleprovidesa: Marginoferror
No ratings yet
Largersampleprovidesa: Marginoferror
3 pages
8.1 Simple Exponential Smoothing - Forecasting - Principles and Practice (3rd Ed)
No ratings yet
8.1 Simple Exponential Smoothing - Forecasting - Principles and Practice (3rd Ed)
8 pages
Regression Analysis for Students
No ratings yet
Regression Analysis for Students
17 pages
Lecture 5
No ratings yet
Lecture 5
127 pages
Mechanisms of Biodiversity Evolution
No ratings yet
Mechanisms of Biodiversity Evolution
34 pages
Laporan Akhir - K-068-020-FD - Hernanda Febrian
No ratings yet
Laporan Akhir - K-068-020-FD - Hernanda Febrian
18 pages
Inversion Equation
No ratings yet
Inversion Equation
4 pages
BIOL 359 Quiz Ans
No ratings yet
BIOL 359 Quiz Ans
5 pages
Forecasting Methods for Enrollment
No ratings yet
Forecasting Methods for Enrollment
6 pages