0% found this document useful (0 votes)

6 views

ML Unit 3_Questions

The document outlines the differences between classification and regression trees, emphasizing their target variables, objectives, splitting criteria, and evaluation metrics. It also discusses key concepts in decision trees such as Information Gain, Entropy, Gini Index, and Gain Ratio, explaining their definitions, formulas, and applications. Additionally, it provides numerical examples for calculating these metrics and concludes with a brief overview of decision trees as a supervised learning technique.

Uploaded by

rachitdhiliwal18

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

6 views

ML Unit 3_Questions

Uploaded by

rachitdhiliwal18

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 7

Unit 3 ML

1. Write the Differences Between Classification Trees and Regression Trees.

Decision Trees are used in machine learning for both classification and regression problems.
Depending on the type of target variable (categorical or continuous), the tree is classified as a
Classification Tree or a Regression Tree.

Differences Between Classification and Regression Trees

Feature Classification Tree Regression Tree

Target Variable Categorical (Discrete labels) Continuous (Numerical values)

Classifies input data into predefined Predicts a continuous numerical

Objective
categories outcome

Splitting Uses Gini Index, Entropy (Information Uses Mean Squared Error (MSE) or
Criteria Gain), or Gain Ratio Variance Reduction

Prediction The most common class in a region

The mean or median value in a region
Output (majority vote)

Example 1 Email classification (Spam/Not Spam) House price prediction

Customer segmentation (High/Medium/Low

Example 2 Predicting sales revenue
risk)

Evaluation
Accuracy, Precision, Recall, F1-score Mean Squared Error (MSE), R² score
Metric

Handling
Less sensitive to outliers Highly sensitive to outliers
Outliers

Tree Structure Often deeper with multiple branches More compact and pruned

Conclusion:

 If the problem involves predicting a category, use a classification tree.

 If the problem involves predicting a numerical value, use a regression tree.

2. Write a Short Note on Information Gain.

Definition:

Information Gain (IG) is a metric used in Decision Trees to determine the best feature to split the
data. It measures the reduction in uncertainty (entropy) after splitting the data based on an
attribute.

Formula:
Example:

Suppose we have a dataset of weather conditions where we predict whether a person will play
tennis (Yes/No) based on outlook (Sunny, Rainy, Overcast).

1. Calculate Entropy (Before Split).

2. Split data based on Outlook.

3. Compute Weighted Entropy after split.

4. Information Gain = Entropy Before - Entropy After

The attribute with the highest Information Gain is chosen for the split.

Importance:

✔ Helps in selecting the best feature for decision-making.

✔ Ensures that the tree grows efficiently with less depth.

3. 3. Write a Short Note on Entropy.

Definition:

Entropy is a measure of randomness or impurity in a dataset. It determines how homogeneous or

heterogeneous the data is.

Formula:

Where:

 Pi=Probability of class iii in dataset SSS

 n = Number of unique classes

Example:

Consider a dataset with 10 instances:

 6 instances belong to Class A.

 4 instances belong to Class B.

Interpretation:

✔ High Entropy (near 1): Data is impure (equal distribution of classes).

✔ Low Entropy (near 0): Data is pure (mostly one class).

Use in Decision Trees:

 Goal: Reduce entropy at each step by splitting the dataset using best attributes.

 Lower entropy means better classification performance.

4. Write a Short Note on Gini Index.

Definition:

The Gini Index (also called Gini Impurity) is another metric used for measuring impurity in a dataset.
It determines how mixed the classes are within a node.

Formula:

Where:

 Pi = Proportion of class iii in dataset SSS

Example:

For a dataset with two classes (A and B):

 6 instances belong to Class A.

 4 instances belong to Class B.

Interpr
etation:

✔ Gini = 0: Perfectly pure node (only one class).

✔ Gini = 1: Completely impure (equal distribution of classes).

Use in Decision Trees:

 Used in CART (Classification and Regression Trees) algorithm.

 Preferred over entropy for faster calculations.

5. Write a Short Note on Gain Ratio.

Definition:

Gain Ratio is an improved version of Information Gain that penalizes attributes with many unique
values to prevent bias.

Formula:

InformationGain
GainRatio=
SplitInformation
Where:

 Information Gain: Measures reduction in entropy.

 Split Information: Measures how evenly data is divided across different attribute values.

Example:

Consider two attributes for classification:

1. "Color" (Red, Green, Blue) → High Information Gain but many unique values.

2. "Size" (Small, Large) → Balanced split with fewer categories.

Gain Ratio prefers "Size" as it provides a more meaningful split.

Use in Decision Trees:

✔ Used in C4.5 Decision Tree Algorithm to improve decision-making.

✔ Helps avoid bias toward attributes with more unique values.

6. Solve All Numerical Information Given in the PPT with Steps.

The numerical problems in the PPT involve:

1. Calculating Entropy

2. Computing Information Gain

3. Finding Gini Index

4. Applying Gain Ratio Formula

I will now solve these step by step. Let me process them.

It looks like there was an issue processing the file directly. However, I can still manually solve the
numerical problems based on the formulas provided in the PPT.

Numerical 1: Calculating Entropy

Problem:
A dataset contains 6 instances of Class A and 4 instances of Class B. Compute Entropy.

Solution:
Final Answer: Entropy = 0.971

Numerical 2: Computing Gini Index

Problem:
A dataset has 6 instances of Class A and 4 instances of Class B. Compute Gini Index.

Final Answer: Gini Index = 0.48

Numerical 3: Computing Information Gain

Problem:
Given a dataset with 10 instances, split into two groups:

 Subset 1: 4 instances (3 A, 1 B)

 Subset 2: 6 instances (3 A, 3 B)
Calculate Information Gain.
Solution:

Final Answer: Information Gain = 0.047

Numerical 4: Gain Ratio Calculation

Problem:
Compute Gain Ratio given:

 Information Gain = 0.047

 Split Info = 0.8

Final Answer: Gain Ratio = 0.05875

7.Short note on Decision trees.

• Decision Tree is a Supervised learning technique that can be used for both classification and
Regression problems.

• It is a classification and prediction tool having a tree-like structure, where each internal node
denotes a test on an attribute, each branch represents an outcome of the test, and each leaf node
(terminal node) holds a class label.

• The goal of using a Decision Tree is to create a training model that can be used to predict the class
or value of the target variable by learning simple decision rules inferred from prior data(training
data).

SNC1W Final Exam Review Package 5
No ratings yet
SNC1W Final Exam Review Package 5
21 pages
Differential Geometry of Manifolds 3r27n5kdr9
No ratings yet
Differential Geometry of Manifolds 3r27n5kdr9
4 pages
Astakvarga and Kaksha Concept: General Use of Kaksha
100% (7)
Astakvarga and Kaksha Concept: General Use of Kaksha
4 pages
Handbook of Electric Power Calculations: H. Wayne Beaty
100% (1)
Handbook of Electric Power Calculations: H. Wayne Beaty
4 pages
NIA Design Manual For Diversion Dams PDF
100% (1)
NIA Design Manual For Diversion Dams PDF
343 pages
Markvi Train
100% (2)
Markvi Train
2 pages
Decision Trees
No ratings yet
Decision Trees
13 pages
COS10022 DSP Week05 Decision Tree and Random Forest
No ratings yet
COS10022 DSP Week05 Decision Tree and Random Forest
50 pages
ML Unit-2
No ratings yet
ML Unit-2
16 pages
Classification - Decision Trees
No ratings yet
Classification - Decision Trees
43 pages
Decision Tree
No ratings yet
Decision Tree
19 pages
DT LossFunctions
No ratings yet
DT LossFunctions
6 pages
Decision Tree: "For Each Node of The Tree, The Information Value Measures
No ratings yet
Decision Tree: "For Each Node of The Tree, The Information Value Measures
3 pages
Attribute Selection Presentation by - Rohit Ghosh
No ratings yet
Attribute Selection Presentation by - Rohit Ghosh
11 pages
Classification With Decision Trees: Instructor: Qiang Yang
100% (1)
Classification With Decision Trees: Instructor: Qiang Yang
62 pages
Aiml Easy Solution
No ratings yet
Aiml Easy Solution
70 pages
DWDM final5
No ratings yet
DWDM final5
45 pages
Decision Trees - Detailed Notes
No ratings yet
Decision Trees - Detailed Notes
8 pages
DT-0 (3 Files Merged)
No ratings yet
DT-0 (3 Files Merged)
143 pages
Classification Trees: C4.5: Vanden Berghen Frank
No ratings yet
Classification Trees: C4.5: Vanden Berghen Frank
5 pages
Learning Decision Trees
No ratings yet
Learning Decision Trees
10 pages
DMDW-CO3-SESSION-14
No ratings yet
DMDW-CO3-SESSION-14
55 pages
Unit-6: Classification and Prediction
No ratings yet
Unit-6: Classification and Prediction
63 pages
3 Decision Trees_LMS
No ratings yet
3 Decision Trees_LMS
47 pages
Decision Tree
No ratings yet
Decision Tree
8 pages
Decision Tree
No ratings yet
Decision Tree
12 pages
Decision Tree Version 3
No ratings yet
Decision Tree Version 3
16 pages
Decision Tree
No ratings yet
Decision Tree
33 pages
09ClassAdvanced
No ratings yet
09ClassAdvanced
64 pages
Machine Learning Unit-3.2
No ratings yet
Machine Learning Unit-3.2
61 pages
06-Classification_Part1
No ratings yet
06-Classification_Part1
44 pages
Experiment No-2
No ratings yet
Experiment No-2
4 pages
MODULE 4-Dr - GM
No ratings yet
MODULE 4-Dr - GM
23 pages
Decision Tree
No ratings yet
Decision Tree
43 pages
Classification With Decision Trees I: Instructor: Qiang Yang
No ratings yet
Classification With Decision Trees I: Instructor: Qiang Yang
29 pages
Decision Tree Algorithm
No ratings yet
Decision Tree Algorithm
18 pages
Decision Tree Tutorial
No ratings yet
Decision Tree Tutorial
8 pages
Gini Vs Entrophy
No ratings yet
Gini Vs Entrophy
8 pages
Decision-Tree Learning .
No ratings yet
Decision-Tree Learning .
29 pages
6CS4-02 Machine Learning Manish Bhardwaj
No ratings yet
6CS4-02 Machine Learning Manish Bhardwaj
625 pages
Tasks on Decision Trees
No ratings yet
Tasks on Decision Trees
11 pages
Day 2 - Session 2: - KNN - Decision Tree - Random Forest - Naïve Bayes Classification
No ratings yet
Day 2 - Session 2: - KNN - Decision Tree - Random Forest - Naïve Bayes Classification
50 pages
Day 5 Supervised Technique-Decision Tree For Classification PDF
100% (1)
Day 5 Supervised Technique-Decision Tree For Classification PDF
58 pages
Assignment 3
No ratings yet
Assignment 3
8 pages
Ml Unit 2 Final_iii Yr
No ratings yet
Ml Unit 2 Final_iii Yr
72 pages
Decision Tree Example
No ratings yet
Decision Tree Example
21 pages
AI Chapter 3 Part 2
No ratings yet
AI Chapter 3 Part 2
51 pages
Act9
No ratings yet
Act9
22 pages
S&ML Unit 6- Q & A
No ratings yet
S&ML Unit 6- Q & A
12 pages
Arbre Decision 2324 4p 1 11
No ratings yet
Arbre Decision 2324 4p 1 11
11 pages
Chapter 3
No ratings yet
Chapter 3
88 pages
Data Mining Algorithms Classification L4
No ratings yet
Data Mining Algorithms Classification L4
7 pages
Classification DecisionTreesNaiveBayeskNN
No ratings yet
Classification DecisionTreesNaiveBayeskNN
75 pages
Data Minning Unit 5 PDF
No ratings yet
Data Minning Unit 5 PDF
19 pages
Data Science Concepts Lesson04 Decision Tree Concepts
No ratings yet
Data Science Concepts Lesson04 Decision Tree Concepts
22 pages
CSE445 NSU Week_4
No ratings yet
CSE445 NSU Week_4
48 pages
Soft Computing Lab Practical Assignment 2
No ratings yet
Soft Computing Lab Practical Assignment 2
10 pages
Concepts and Techniques: Data Mining
100% (1)
Concepts and Techniques: Data Mining
81 pages
IDAI610_PS1_DecisionTree
No ratings yet
IDAI610_PS1_DecisionTree
5 pages
DECISION TREES-jb
No ratings yet
DECISION TREES-jb
8 pages
Decision Tree
No ratings yet
Decision Tree
23 pages
Decision Trees: Classifier
No ratings yet
Decision Trees: Classifier
23 pages
Decision Tree
100% (4)
Decision Tree
66 pages
Alternating Decision Tree: Fundamentals and Applications
From Everand
Alternating Decision Tree: Fundamentals and Applications
Fouad Sabry
No ratings yet
Decision Tree Pruning: Fundamentals and Applications
From Everand
Decision Tree Pruning: Fundamentals and Applications
Fouad Sabry
No ratings yet
De-Mystifying Math and Stats for Machine Learning: Mastering the Fundamentals of Mathematics and Statistics for Machine Learning
From Everand
De-Mystifying Math and Stats for Machine Learning: Mastering the Fundamentals of Mathematics and Statistics for Machine Learning
Seaport AI Madhavan
No ratings yet
2012 Maths Practice Paper (M1 & M2) Marking Scheme
No ratings yet
2012 Maths Practice Paper (M1 & M2) Marking Scheme
23 pages
Last Herb 180 Vertigo My13
No ratings yet
Last Herb 180 Vertigo My13
3 pages
Manual BR400 v1.1 EN
No ratings yet
Manual BR400 v1.1 EN
110 pages
Zw3D CAM Training Guide
No ratings yet
Zw3D CAM Training Guide
210 pages
Biochemistry Practical PDF
No ratings yet
Biochemistry Practical PDF
21 pages
Calibration Certificate
No ratings yet
Calibration Certificate
42 pages
Stream Bank Protection and Erosion Damage Mitigation Measures
No ratings yet
Stream Bank Protection and Erosion Damage Mitigation Measures
19 pages
Uigukm
No ratings yet
Uigukm
8 pages
Week 12 13
No ratings yet
Week 12 13
28 pages
Chapter 9 - Image Compression Standards
75% (4)
Chapter 9 - Image Compression Standards
59 pages
Mathematics Standard Year 11 Topic Guide Algebra
No ratings yet
Mathematics Standard Year 11 Topic Guide Algebra
8 pages
Surveying 2 Final Term Modules
No ratings yet
Surveying 2 Final Term Modules
28 pages
PlantPAx DCS - 4.6 (Released 6 - 2019)
No ratings yet
PlantPAx DCS - 4.6 (Released 6 - 2019)
2 pages
16.45-17.45 KTM Advanced Technical Track
100% (2)
16.45-17.45 KTM Advanced Technical Track
99 pages
F1 Merged
No ratings yet
F1 Merged
79 pages
14a20b2ed284e62f3f1ba34e42872fea_e3d2685d66cb0554efcb57bb4bd332c6
No ratings yet
14a20b2ed284e62f3f1ba34e42872fea_e3d2685d66cb0554efcb57bb4bd332c6
1 page
2019 S KForestFrog J Ex
No ratings yet
2019 S KForestFrog J Ex
8 pages
CRPC Pavement Design PDF
No ratings yet
CRPC Pavement Design PDF
20 pages
Absorption Refrigeration System
No ratings yet
Absorption Refrigeration System
4 pages
Path Coverage
No ratings yet
Path Coverage
30 pages
Release Notes For Ipp 6.0: Image-Pro Plus 6.0 Has An All-New Ipbasic Editor. This Editor Adds Command
No ratings yet
Release Notes For Ipp 6.0: Image-Pro Plus 6.0 Has An All-New Ipbasic Editor. This Editor Adds Command
4 pages
Water Waves
No ratings yet
Water Waves
24 pages
Wazwaz IEch 3 S 2 S 3 P 8
No ratings yet
Wazwaz IEch 3 S 2 S 3 P 8
1 page
LiBAL N BMS High Voltage™ Short
No ratings yet
LiBAL N BMS High Voltage™ Short
10 pages