0% found this document useful (0 votes)

7 views

Decision Tree

A Decision Tree is a machine learning model structured as a flowchart, where internal nodes represent decisions on attributes, branches show outcomes, and leaf nodes indicate results. The model predicts outcomes by recursively splitting data based on the best features until reaching leaf nodes, and it can generate decision rules for interpretability. While easy to understand and requiring minimal data preparation, Decision Trees can suffer from overfitting, instability, and bias towards dominant classes.

Uploaded by

jokesodysseyurdu

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

7 views

Decision Tree

Uploaded by

jokesodysseyurdu

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 5

1.

Decision Trees

A Decision Tree is a machine learning model that is represented as a flowchart-like structure, where:

 Internal nodes represent a decision or test on an attribute.

 Branches represent the outcome of the decision or test.
 Leaf nodes represent the result or output, such as a class label (in classification) or a value (in
regression).

The tree is structured in a way that helps decision-making based on the features (attributes) of the data.
The flow from the root to the leaf nodes provides a decision rule that helps predict the class or value for
a given set of features.

How Decision Trees Work:

1. Start at the Root Node: The root node represents the entire dataset. We begin by selecting a
feature (attribute) that best splits the data into different classes or outcomes. This split is
determined by specific criteria like Gini Impurity, Information Gain, or Variance Reduction.
2. Split Data Based on Features: At each internal node, the dataset is split based on the feature
that provides the best separation between classes or predicts the target value the best.
3. Continue Splitting: This process continues recursively at each internal node until we reach the
leaf nodes. These leaf nodes hold the final decision (class label for classification or value for
regression).
4. Make a Prediction: For new, unseen data, the prediction is made by following the tree structure
from the root to a leaf node, applying the decisions (tests) along the way.

Decision Rules:

 Definition: A decision rule is a simple "if-then" condition derived from the decision tree.
 Example: Consider a decision tree for classifying whether someone will buy a product based on
their age and income:
o If Age ≤ 30 and Income > 50,000, then "Buy Product" (Class 1).
o If Age > 30 and Income ≤ 50,000, then "Don't Buy Product" (Class 0).

These rules are extracted from the paths leading to the leaf nodes in the decision tree.

Example of Decision Tree for Classification:

Let’s consider a small example to illustrate how a decision tree works for classification:

Problem:

Classify whether a person will play tennis based on the weather conditions (Outlook, Temperature,
Humidity, Wind).
 Attributes: Outlook (Sunny, Overcast, Rain), Temperature (Hot, Mild, Cool), Humidity (High,
Low), Wind (Weak, Strong)
 Target/Label: PlayTennis (Yes, No)

Dataset:
Outlook Temperature Humidity Wind PlayTennis

Sunny Hot High Weak No

Sunny Hot High Strong No

Overcast Hot High Weak Yes

Rain Mild High Weak Yes

Rain Cool Low Weak Yes

Rain Cool Low Strong No

Overcast Cool Low Strong Yes

Sunny Mild High Weak No

Sunny Cool Low Weak Yes

Rain Mild Low Weak Yes

Building the Decision Tree:

1. Step 1: Select the Root Node: The root node is selected based on the best feature to split the
data. In this case, we would use a criterion like Information Gain to decide the best feature.

After calculating the Information Gain, we might find that Outlook is the best feature to split the
data, as it has the highest Information Gain.

2. Step 2: Split Data: The tree branches into three based on the possible values of Outlook (Sunny,
Overcast, Rain).
3. Step 3: Continue Splitting: Now, for each of these branches, we further split based on the next
best feature (say, Humidity or Wind).
o For Sunny, the tree might split based on Humidity: If Humidity = High, predict "No"
(Leaf node), otherwise "Yes".
o For Rain, the tree might split based on Wind: If Wind = Weak, predict "Yes" (Leaf node),
otherwise "No".
4. Step 4: Reach Leaf Nodes: The decision tree will keep splitting until it reaches leaf nodes with a
predicted label.

Decision Tree Diagram:

Below is a simplified decision tree for the above example.

Explanation of the Tree:

1. Root Node: The first decision is based on Outlook.

o If Outlook is Overcast, predict Yes (PlayTennis).
o If Outlook is Sunny, we move to the next test: Humidity.
 If Humidity is High, predict No.
 If Humidity is Low, predict Yes.
o If Outlook is Rain, the next test is Wind.
 If Wind is Weak, predict Yes.
 If Wind is Strong, predict No.

Advantages:

 Easy to Interpret: The model is visual and intuitive, making it easy to explain to non-experts.
 No Need for Data Preparation: Minimal data preprocessing is required (e.g., no need for
normalization or scaling).

Disadvantages:

 Overfitting: Decision trees can easily overfit to training data, especially with deep trees.
 Instability: Small changes in the data can result in a completely different tree.
 Bias toward Dominant Classes: Decision trees can be biased if the dataset is imbalanced.

2. Generating Decision Trees

To construct a Decision Tree:

1. Choose the Best Attribute:

o Use measures like Information Gain or the Gini Index to identify the attribute that
splits the data most effectively.
2. Recursively Split Data:
o Apply the splitting process to each subset until the stopping criteria are met.
3. Assign Labels or Predictions:
o At the leaf nodes, assign the majority class label (for classification) or the average
value (for regression).

3. Pruning Decision Trees

Pruning is the process of reducing the size of a decision tree to prevent overfitting and improve
generalization.

 Pre-Pruning: Stop the tree's growth early based on conditions like maximum depth or
minimum data at a node.
 Post-Pruning: Grow the entire tree and then remove branches that do not improve
performance on a validation set.

5. Decision Rules

Decision Rules are IF-THEN conditions derived from Decision Trees. For example, a rule might
look like:

 IF age > 30 AND income > 50K THEN approve loan.

These rules provide a straightforward way to represent the tree’s logic, offering interpretability
and flexibility in practical applications.
6. Limitations of Decision Trees and Rules

1. Overfitting:
o Decision Trees can grow excessively, capturing noise in the training data.
o Pruning helps mitigate this but may lead to underfitting if over-pruned.
2. Bias Towards Dominant Features:
o Trees can favor features with many levels (e.g., ID numbers) or numeric features
with high variance.
3. Instability:
o Small changes in the training data can lead to entirely different tree structures.
4. Performance on Complex Relationships:
o Decision Trees struggle with datasets where features interact in complex, non-
linear ways.
5. Scalability:
o For very large datasets, tree construction can become computationally expensive.

How To Talk To Anyone (Book)
100% (3)
How To Talk To Anyone (Book)
11 pages
Decision Tree
No ratings yet
Decision Tree
16 pages
Decision Tree
No ratings yet
Decision Tree
6 pages
Lecture Notes 3
No ratings yet
Lecture Notes 3
11 pages
Decision Tree
No ratings yet
Decision Tree
45 pages
DMI UNIT 4
No ratings yet
DMI UNIT 4
34 pages
Decision Tree
No ratings yet
Decision Tree
21 pages
Prac 6
No ratings yet
Prac 6
6 pages
Lecture Note 5
No ratings yet
Lecture Note 5
7 pages
UNIT-3 ML notes
No ratings yet
UNIT-3 ML notes
4 pages
Decision Trees
No ratings yet
Decision Trees
18 pages
Decision Trees
No ratings yet
Decision Trees
18 pages
Lecture 19 - Decision Tress
No ratings yet
Lecture 19 - Decision Tress
21 pages
TEAA_ Tree Ensembles-1
No ratings yet
TEAA_ Tree Ensembles-1
43 pages
2b Decision Tree 18may
No ratings yet
2b Decision Tree 18may
16 pages
Decision Trees
No ratings yet
Decision Trees
21 pages
Foundations of Machine Learning: Module 2: Linear Regression and Decision Tree
100% (2)
Foundations of Machine Learning: Module 2: Linear Regression and Decision Tree
16 pages
Lecture Note #5_PEC-CS701E
No ratings yet
Lecture Note #5_PEC-CS701E
16 pages
HSMC
No ratings yet
HSMC
5 pages
Learning Decision Trees
No ratings yet
Learning Decision Trees
13 pages
Chapter 4classification and Prediction
No ratings yet
Chapter 4classification and Prediction
19 pages
Decision Tree Ppt
0% (1)
Decision Tree Ppt
24 pages
Machine_Learning_Lecture_08_Decision Tree Learning (1)
No ratings yet
Machine_Learning_Lecture_08_Decision Tree Learning (1)
67 pages
ml unit3
No ratings yet
ml unit3
8 pages
BPE 22, Decision Trees
No ratings yet
BPE 22, Decision Trees
11 pages
m3
No ratings yet
m3
141 pages
Unit Ii
No ratings yet
Unit Ii
22 pages
Unit-5 Decision Trees & Ensembles Methods
No ratings yet
Unit-5 Decision Trees & Ensembles Methods
11 pages
Breaking Down Decision Tree Algorithm
No ratings yet
Breaking Down Decision Tree Algorithm
10 pages
Unit 4
No ratings yet
Unit 4
33 pages
Classification and Clustering
No ratings yet
Classification and Clustering
59 pages
Decisiontree
No ratings yet
Decisiontree
6 pages
Decision tree
No ratings yet
Decision tree
16 pages
Decision Treesnotes
No ratings yet
Decision Treesnotes
3 pages
Decision Tree
No ratings yet
Decision Tree
16 pages
Decision Tree Classification Algorithm (2)
No ratings yet
Decision Tree Classification Algorithm (2)
11 pages
Decision Tree
No ratings yet
Decision Tree
7 pages
Data Mining Notes Unit 4
No ratings yet
Data Mining Notes Unit 4
30 pages
1822-b.e-cse-batchno-149
No ratings yet
1822-b.e-cse-batchno-149
66 pages
6 Decision Trees in Data Mining
No ratings yet
6 Decision Trees in Data Mining
10 pages
Data Mining: Classification-1
No ratings yet
Data Mining: Classification-1
53 pages
Decision Tree Comprehesive
No ratings yet
Decision Tree Comprehesive
7 pages
Unit Iir20
No ratings yet
Unit Iir20
22 pages
Decision Tree & Random Forest
No ratings yet
Decision Tree & Random Forest
28 pages
Introduction to Decision Tree Algorithm
No ratings yet
Introduction to Decision Tree Algorithm
11 pages
Decision Trees Presentation
No ratings yet
Decision Trees Presentation
10 pages
Decision Tree
100% (1)
Decision Tree
57 pages
Konsep Ensemble
No ratings yet
Konsep Ensemble
52 pages
Business Analytics: Foundation: Material Handouts
No ratings yet
Business Analytics: Foundation: Material Handouts
7 pages
Decision Tree (1)
No ratings yet
Decision Tree (1)
7 pages
Machine Learning chapter 4
No ratings yet
Machine Learning chapter 4
9 pages
decision tree
No ratings yet
decision tree
66 pages
Decision Tree
No ratings yet
Decision Tree
31 pages
Decision Tree Classification Algorithm
No ratings yet
Decision Tree Classification Algorithm
4 pages
06 - Decision Trees
No ratings yet
06 - Decision Trees
14 pages
Assignment of Decision Tree
No ratings yet
Assignment of Decision Tree
15 pages
Decision Tree
No ratings yet
Decision Tree
7 pages
Ch02 DecisionTree
No ratings yet
Ch02 DecisionTree
41 pages
Decision Tree
No ratings yet
Decision Tree
57 pages
Decision Tree Pruning: Fundamentals and Applications
From Everand
Decision Tree Pruning: Fundamentals and Applications
Fouad Sabry
No ratings yet
Learn The Basics Of Decision Trees A Popular And Powerful Machine Learning Algorithm
From Everand
Learn The Basics Of Decision Trees A Popular And Powerful Machine Learning Algorithm
UBER AUTHOR
No ratings yet
Tenant Retention Strategy
No ratings yet
Tenant Retention Strategy
5 pages
Bububu 3
No ratings yet
Bububu 3
13 pages
Gate-Solved-Question-Papers-For-Electronics & Communication Engineering PDF
No ratings yet
Gate-Solved-Question-Papers-For-Electronics & Communication Engineering PDF
112 pages
SMS New Admission
No ratings yet
SMS New Admission
20 pages
Architectural Correlation - Tropical Design
No ratings yet
Architectural Correlation - Tropical Design
4 pages
Amity International School, Noida Class Ix, Work, Energy & Power Worksheet-3
100% (1)
Amity International School, Noida Class Ix, Work, Energy & Power Worksheet-3
2 pages
WLP 6 Urdu
No ratings yet
WLP 6 Urdu
2 pages
#GST111 Lesson9
No ratings yet
#GST111 Lesson9
2 pages
HCM 2010 Chapter 10
0% (1)
HCM 2010 Chapter 10
69 pages
Oriental Private Iti, Indore
No ratings yet
Oriental Private Iti, Indore
3 pages
K1000TCi-D_V3.1_┴¼¢ËÁ¸╩È╩Í▓ßen_190424
No ratings yet
K1000TCi-D_V3.1_┴¼¢ËÁ¸╩È╩Í▓ßen_190424
265 pages
Query Update Qty - Kirim Purchasing - Item
No ratings yet
Query Update Qty - Kirim Purchasing - Item
2 pages
RRL
No ratings yet
RRL
2 pages
Defining Agroecology
No ratings yet
Defining Agroecology
3 pages
Lecture 1
No ratings yet
Lecture 1
6 pages
Causative Fungi and Treatment Outcome of Dematiaceous Fungal Keratitis in North India. 2019
No ratings yet
Causative Fungi and Treatment Outcome of Dematiaceous Fungal Keratitis in North India. 2019
7 pages
A Circuit For External Quenching of The Discharge in The Geiger-Muller Counter
No ratings yet
A Circuit For External Quenching of The Discharge in The Geiger-Muller Counter
3 pages
Eco-Education - Integrating Environmental Topics in Curriculum
No ratings yet
Eco-Education - Integrating Environmental Topics in Curriculum
5 pages
Dolls House Socratic Reflection
No ratings yet
Dolls House Socratic Reflection
1 page
rohini_46072932395
No ratings yet
rohini_46072932395
4 pages
9 Ideal Truss: Force Method
No ratings yet
9 Ideal Truss: Force Method
8 pages
Cambridge IGCSE: Additional Mathematics 0606/13
No ratings yet
Cambridge IGCSE: Additional Mathematics 0606/13
16 pages
Revision Worksheet Xi
No ratings yet
Revision Worksheet Xi
2 pages
2019 GKS Associate Degree Program Successful Candidates of 2nd Round
No ratings yet
2019 GKS Associate Degree Program Successful Candidates of 2nd Round
1 page
Learning Module in Body Mechanics
No ratings yet
Learning Module in Body Mechanics
3 pages
Mathematics 1-4 - Month 1
No ratings yet
Mathematics 1-4 - Month 1
22 pages
[Ebooks PDF] download (eBook PDF) Marketing Strategy and Competitive Positioning by Graham J. Hooley full chapters
100% (4)
[Ebooks PDF] download (eBook PDF) Marketing Strategy and Competitive Positioning by Graham J. Hooley full chapters
46 pages
My-Body-Map Activity
No ratings yet
My-Body-Map Activity
1 page
Asia Pacific Emc (Apemc), Bali 27-30 September 2021
No ratings yet
Asia Pacific Emc (Apemc), Bali 27-30 September 2021
132 pages