0% found this document useful (0 votes)

133 views

Hyperparameters Hyperparameters For Decision Trees: Maximum Depth

This document discusses hyperparameters that can be tuned in decision trees to help them generalize well to new problems. The key hyperparameters mentioned are: 1. Maximum depth, which controls the longest path from root to leaf and impacts the complexity of the tree. 2. Minimum samples to split, which is the minimum number of samples a node must have to be split further. 3. Minimum samples per leaf, which controls the minimum number of samples allowed in each leaf node to avoid leaves with very few samples.

Uploaded by

adityaacharya44

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

133 views

Hyperparameters Hyperparameters For Decision Trees: Maximum Depth

Uploaded by

adityaacharya44

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 4

Back to Home

16. Hyperparameters
Hyperparameters for Decision Trees
In order to create decision trees that will generalize to new problems well, we can
tune a number of different aspects about the trees. We call the different aspects of a
decision tree "hyperparameters". These are some of the most important
hyperparameters used in decision trees:

Maximum Depth

The maximum depth of a decision tree is simply the largest possible length between
the root to a leaf. A tree of maximum length
kkk
can have at most
2k2^k2k
leaves.

Maximum depth of a decision tree

Minimum number of samples to split

A node must have at least min_samples_split samples in order to be large enough to

split. If a node has fewer samples than min_samples_split samples, it will not be split,
and the splitting process stops.
Minimum number of samples to split

However, min_samples_split doesn't control the minimum size of leaves. As you can
see in the example on the right, above, the parent node had 20 samples, greater than
min_samples_split = 11, so the node was split. But when the node was split, a child
node was created with that had 5 samples, less than min_samples_split = 11.

Minimum number of samples per leaf

When splitting a node, one could run into the problem of having 99 samples in one
of them, and 1 on the other. This will not take us too far in our process, and would be
a waste of resources and time. If we want to avoid this, we can set a minimum for the
number of samples we allow on each leaf.
Minimum number of samples per leaf

This number can be specified as an integer or as a float. If it's an integer, it's the
minimum number of samples allowed in a leaf. If it's a float, it's the minimum
percentage of samples allowed in a leaf. For example, 0.1, or 10%, implies that a
particular split will not be allowed if one of the leaves that results contains less than
10% of the samples in the dataset.

If a threshold on a feature results in a leaf that has fewer samples than

min_samples_leaf, the algorithm will not allow that split, but it may perform a split on
the same feature at a different threshold, that does satisfy min_samples_leaf.

Overfitting Underfitting quiz

QUIZ QUESTION::

Let's test your intuition. Which sizes of features are associated with underfitting and
which with overfitting? Drag the answers to the corresponding boxes.

ANSWER CHOICES:

Small maximum depth

Large maximum depth

Small minimum samples per split

Large minimum samples per split

Feature Underfitting/Overfitting
Overfitting
Underfitting
Underfitting
Overfitting
SOLUTION:

Feature Underfitting/Overfitting
Overfitting Large maximum depth
Overfitting Small minimum samples per split
Underfitting Small maximum depth
Underfitting Large minimum samples per split
Underfitting Small maximum depth
Underfitting Large minimum samples per split
Overfitting Large maximum depth
Overfitting Small minimum samples per split
Next Concept

udacimak v1.4.0

Annotated Version of Philip Guo's 2018 NSF CAREER Proposal
No ratings yet
Annotated Version of Philip Guo's 2018 NSF CAREER Proposal
25 pages
Virgo: The Complete Book of
No ratings yet
Virgo: The Complete Book of
48 pages
TCS ION Certificate Courses - Opt
No ratings yet
TCS ION Certificate Courses - Opt
9 pages
Wells Fargo Egs (India) Private Limited
No ratings yet
Wells Fargo Egs (India) Private Limited
1 page
Decision Tree & Random Forest
No ratings yet
Decision Tree & Random Forest
16 pages
Underfitting and Overfitting
No ratings yet
Underfitting and Overfitting
5 pages
M2 Decision trees
No ratings yet
M2 Decision trees
37 pages
Decision Trees
No ratings yet
Decision Trees
5 pages
Decision Trees
No ratings yet
Decision Trees
11 pages
Decision Trees
No ratings yet
Decision Trees
37 pages
Decision Tree Algorithm
No ratings yet
Decision Tree Algorithm
14 pages
Decision Trees
No ratings yet
Decision Trees
8 pages
ESGB_2025_classification and regression tress [Enregistré automatiquement]
No ratings yet
ESGB_2025_classification and regression tress [Enregistré automatiquement]
43 pages
9-Module 5 Decision Tree-21-03-2024
No ratings yet
9-Module 5 Decision Tree-21-03-2024
83 pages
Machine Learning: Version 2 CSE IIT, Kharagpur
No ratings yet
Machine Learning: Version 2 CSE IIT, Kharagpur
6 pages
Lecture 7 Overview of ML models
No ratings yet
Lecture 7 Overview of ML models
77 pages
Lesson 36 - Rule Induction and Decision Tree II
No ratings yet
Lesson 36 - Rule Induction and Decision Tree II
6 pages
Random Forest Regression
No ratings yet
Random Forest Regression
57 pages
Decision Trees-Lecture 9&10
No ratings yet
Decision Trees-Lecture 9&10
60 pages
decision_trees_implementation (1)
No ratings yet
decision_trees_implementation (1)
13 pages
Hyperparameter tuning
No ratings yet
Hyperparameter tuning
4 pages
CSET301 LabW8L2
No ratings yet
CSET301 LabW8L2
1 page
19 -- Decision Tree -- ID3
No ratings yet
19 -- Decision Tree -- ID3
87 pages
Decision Tree
No ratings yet
Decision Tree
26 pages
Decision Trees
100% (1)
Decision Trees
61 pages
Decision Treesnotes
No ratings yet
Decision Treesnotes
3 pages
Random Forest: The Algorithm in A Nutshell
No ratings yet
Random Forest: The Algorithm in A Nutshell
10 pages
Decision Trees Cheat Sheet PDF
No ratings yet
Decision Trees Cheat Sheet PDF
2 pages
Optimized hyperparameters tuning of multi-class classification algorithms
No ratings yet
Optimized hyperparameters tuning of multi-class classification algorithms
17 pages
Decision Tree
No ratings yet
Decision Tree
28 pages
Classification
No ratings yet
Classification
8 pages
TEAA_ Tree Ensembles-1
No ratings yet
TEAA_ Tree Ensembles-1
43 pages
Hyperparametric Tuning of XG and RFC
No ratings yet
Hyperparametric Tuning of XG and RFC
2 pages
Unit 3
No ratings yet
Unit 3
31 pages
Week 2 Lecture Notes
No ratings yet
Week 2 Lecture Notes
61 pages
Decision Tree
No ratings yet
Decision Tree
54 pages
Lecture+Notes+-+Random Forests
No ratings yet
Lecture+Notes+-+Random Forests
10 pages
Dtree&rf
No ratings yet
Dtree&rf
26 pages
LVC+1+Post-Session+Summary
No ratings yet
LVC+1+Post-Session+Summary
9 pages
C4.5 and CHAID Algorithm: Pavan J Joshi 2010MCS2095 Special Topics in Database Systems
No ratings yet
C4.5 and CHAID Algorithm: Pavan J Joshi 2010MCS2095 Special Topics in Database Systems
30 pages
DM chapter 4
No ratings yet
DM chapter 4
6 pages
Decision Tree: Courtesy: Prof. Pabitra Mitra, CSE, IIT Kharagpur
No ratings yet
Decision Tree: Courtesy: Prof. Pabitra Mitra, CSE, IIT Kharagpur
73 pages
2 ML Ch3 Decision Trees Final
No ratings yet
2 ML Ch3 Decision Trees Final
70 pages
Decision Trees_ a Complete Introduction With Examples _ by Shubham Koli _ Medium
No ratings yet
Decision Trees_ a Complete Introduction With Examples _ by Shubham Koli _ Medium
22 pages
Lecture 04 Decession Trees 04112022 015118pm
No ratings yet
Lecture 04 Decession Trees 04112022 015118pm
43 pages
Data Mining Notes Unit 4
No ratings yet
Data Mining Notes Unit 4
30 pages
Decision Trees
No ratings yet
Decision Trees
15 pages
Overfitting Decision Trees
No ratings yet
Overfitting Decision Trees
68 pages
Tables
No ratings yet
Tables
10 pages
Decision Tree Comprehesive
No ratings yet
Decision Tree Comprehesive
7 pages
Tree
No ratings yet
Tree
7 pages
Decision Tree
No ratings yet
Decision Tree
58 pages
Lesson 5.0 Supervised Learning with Decision Trees (1)
No ratings yet
Lesson 5.0 Supervised Learning with Decision Trees (1)
16 pages
Hyperparameter_Tuning_in_Machine_Learning_1706249573
No ratings yet
Hyperparameter_Tuning_in_Machine_Learning_1706249573
9 pages
Apznzayn4iudcvxyoppqs61j04 7hfvwveb4orry3irmq7ekrlv08lh81olz64cb1ycwzmxuattzrg0ox0g-e Tcprei1i3bwhbnbqofqhvtixwokm0ftaoxwee3znpcytoh6jgknlof6 Rukjysosqdyan8wfbovpzrikmrpeywyu07ft Vvpsanuerxuhcghc7g6sd4pcyi9z-Wao8bn
No ratings yet
Apznzayn4iudcvxyoppqs61j04 7hfvwveb4orry3irmq7ekrlv08lh81olz64cb1ycwzmxuattzrg0ox0g-e Tcprei1i3bwhbnbqofqhvtixwokm0ftaoxwee3znpcytoh6jgknlof6 Rukjysosqdyan8wfbovpzrikmrpeywyu07ft Vvpsanuerxuhcghc7g6sd4pcyi9z-Wao8bn
20 pages
Multivariate Decision Trees: © 1995 Kluwer Academic Publishers, Boston. Manufactured in The Netherlands
No ratings yet
Multivariate Decision Trees: © 1995 Kluwer Academic Publishers, Boston. Manufactured in The Netherlands
33 pages
Lec4 - Decision Trees
No ratings yet
Lec4 - Decision Trees
43 pages
Unit 4
No ratings yet
Unit 4
33 pages
ML Mod2
No ratings yet
ML Mod2
5 pages
Decision Trees in Sklearn Decision Trees in Sklearn
No ratings yet
Decision Trees in Sklearn Decision Trees in Sklearn
7 pages
2023AIB1008_Lab08
No ratings yet
2023AIB1008_Lab08
8 pages
1630303435 ML TCS Lecture 1608 DecisionTree
No ratings yet
1630303435 ML TCS Lecture 1608 DecisionTree
41 pages
Decision Tree Pruning: Fundamentals and Applications
From Everand
Decision Tree Pruning: Fundamentals and Applications
Fouad Sabry
No ratings yet
How to Observe Software Systems
From Everand
How to Observe Software Systems
Gerald M. Weinberg
No ratings yet
Goldman Sachs - Internal Audit - Model Risk
No ratings yet
Goldman Sachs - Internal Audit - Model Risk
2 pages
Multi-Class Entropy: MMM NNN
No ratings yet
Multi-Class Entropy: MMM NNN
1 page
Scanned With Camscanner
No ratings yet
Scanned With Camscanner
3 pages
Proe Summary: A Powerful Exploratory Data Analysis Tool: Systems Seminar Consultants, Kalamazoo, MI
No ratings yet
Proe Summary: A Powerful Exploratory Data Analysis Tool: Systems Seminar Consultants, Kalamazoo, MI
10 pages
Freq PDF
No ratings yet
Freq PDF
207 pages
Mitu Mishra: Mobile: 07045726354 Preferred Location: Bangalore Address: Narsingarh, Dhalbhumgarh
No ratings yet
Mitu Mishra: Mobile: 07045726354 Preferred Location: Bangalore Address: Narsingarh, Dhalbhumgarh
5 pages
Request Number Requestor Previous Owner: TCS Confidential
No ratings yet
Request Number Requestor Previous Owner: TCS Confidential
3 pages
1D Arrays - 1
No ratings yet
1D Arrays - 1
27 pages
Choeu Phaneth - Builder Design Pattern
No ratings yet
Choeu Phaneth - Builder Design Pattern
4 pages
SIA-MDA and The Future
No ratings yet
SIA-MDA and The Future
2 pages
ARM-UNIT3 and UNIT4 Question Bank
No ratings yet
ARM-UNIT3 and UNIT4 Question Bank
3 pages
Execute Python Syntax
No ratings yet
Execute Python Syntax
8 pages
CSC 113 PAST QUESTIONS
No ratings yet
CSC 113 PAST QUESTIONS
8 pages
C-C++ Internship Tasks
No ratings yet
C-C++ Internship Tasks
16 pages
Introduction To R Programming
No ratings yet
Introduction To R Programming
23 pages
Training IDMS
80% (5)
Training IDMS
258 pages
Digital Design With SM Charts
No ratings yet
Digital Design With SM Charts
26 pages
Ama Mq4 Code
No ratings yet
Ama Mq4 Code
3 pages
Class XII (As Per CBSE Board) : Computer Science
No ratings yet
Class XII (As Per CBSE Board) : Computer Science
38 pages
What Is Preprocessor
No ratings yet
What Is Preprocessor
15 pages
Cs401 Grand Quiz
No ratings yet
Cs401 Grand Quiz
281 pages
Virtual Functions and Polymorphism
No ratings yet
Virtual Functions and Polymorphism
22 pages
Power BI & Data Analytics (1)tydhgfc
No ratings yet
Power BI & Data Analytics (1)tydhgfc
5 pages
Question Bank PPS (1)
No ratings yet
Question Bank PPS (1)
3 pages
Linear Search
No ratings yet
Linear Search
11 pages
Prolog Games
No ratings yet
Prolog Games
5 pages
70 483 Part2
No ratings yet
70 483 Part2
95 pages
Dgalab: An Extensible Software Implementation For Dga: Saleh I. Ibrahim, Sherif S.M. Ghoneim, Ibrahim B.M. Taha
No ratings yet
Dgalab: An Extensible Software Implementation For Dga: Saleh I. Ibrahim, Sherif S.M. Ghoneim, Ibrahim B.M. Taha
8 pages
C CCCCCCCCCCCCCCCC CC CCCCCCCCCCCCCCCCCCCCCCCCCCCCC C C C CCC C
No ratings yet
C CCCCCCCCCCCCCCCC CC CCCCCCCCCCCCCCCCCCCCCCCCCCCCC C C C CCC C
7 pages
Unit - III CNC Part Programming
No ratings yet
Unit - III CNC Part Programming
39 pages
Spos Endsem Model Answer Jan 2023
No ratings yet
Spos Endsem Model Answer Jan 2023
24 pages
unit 3
No ratings yet
unit 3
148 pages
Data Structures and Algorithm Analysis-Prelims
100% (1)
Data Structures and Algorithm Analysis-Prelims
4 pages
Example, Showing Entries in Different Databases: Relocatable
No ratings yet
Example, Showing Entries in Different Databases: Relocatable
15 pages
Stack Operations of Linear Array: Lab Report No. 5
No ratings yet
Stack Operations of Linear Array: Lab Report No. 5
3 pages
Introduction To PHP History of PHP
No ratings yet
Introduction To PHP History of PHP
14 pages

Hyperparameters Hyperparameters For Decision Trees: Maximum Depth

Uploaded by

Hyperparameters Hyperparameters For Decision Trees: Maximum Depth

Uploaded by

Back to Home

Maximum depth of a decision tree

Minimum number of samples to split

A node must have at least min_samples_split samples in order to be large enough to

Minimum number of samples per leaf

If a threshold on a feature results in a leaf that has fewer samples than

Overfitting Underfitting quiz

Small maximum depth

Large maximum depth

Small minimum samples per split

Large minimum samples per split

You might also like