0% found this document useful (0 votes)

58 views

ML Unit-3

The document discusses decision tree models and rule-based models. It explains that decision trees can be used for classification and regression tasks, with nodes representing attributes, branches representing answers, and leaves representing the output class or value. The document also describes how decision trees are constructed using a top-down approach to select the best attribute at each step. Finally, it discusses how decision trees can be converted to rule-based models by extracting rules from the tree paths.

Uploaded by

products info

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

58 views

ML Unit-3

Uploaded by

products info

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

Unit-3

Chapter-1

Tree models

Decision Trees are a non-parametric supervised learning method used for both classification
and regression tasks. Tree models where the target variable can take a discrete set of values
are called classification trees.

A decision tree is a tree-like graph with nodes representing the place where we pick an
attribute and ask a question; edges represent the answers the to the question; and the leaves
represent the actual output or class label. They are used in non-linear decision making with
simple linear decision surface.

Let's illustrate this with help of an example. Let's assume we want to play badminton on a
particular day — say Saturday — how will you decide whether to play or not. Let's say you
go out and check if it's hot or cold, check the speed of the wind and humidity, how the
weather is, i.e. is it sunny, cloudy, or rainy. You take all these factors into account to decide if
you want to play or not.

So, you calculate all these factors for the last ten days and form a lookup table like the one
below.

Day Weather Temperature Humidity Wind Play?

1 Sunny Hot High Weak No
2 Cloudy Hot High Weak Yes
3 Sunny Mild Normal Strong Yes
4 Cloudy Mild High Strong Yes
5 Rainy Mild High Strong No
6 Rainy Cool Normal Strong No
7 Rainy Mild High Weak Yes
8 Sunny Hot High Strong No
9 Cloudy Hot Normal Weak Yes
10 Rainy Mild High Strong No
Table 1. Obeservations of the last ten days.

Now, you may use this table to decide whether to play or not. But, what if the weather pattern
on Saturday does not match with any of rows in the table? This may be a problem. A decision
tree would be a great way to represent data like this because it takes into account all the
possible paths that can lead to the final decision by following a tree-like structure.

1
Fig 1. illustrates a learned decision tree. We can see that each node represents an attribute or
feature and the branch from each node represents the outcome of that node. Finally, its the
leaves of the tree where the final decision is made.

A general algorithm for a decision tree can be described as follows:

1. Pick the best attribute/feature. The best attribute is one which best splits or separates
the data.
2. Ask the relevant question.
3. Follow the answer path.
4. Go to step 1 until you arrive to the answer.

The best split is one which separates two different labels into two sets.

Expressiveness of decision trees

Decision trees can represent any boolean function of the input attributes. Let’s use decision
trees to perform the function of three boolean gates AND, OR and XOR.

Boolean Function: AND

2
Fig 3. Decision tree for an AND operation.

In Fig 3., we can see that there are two candidate concepts for producing the decision tree that
performs the AND operation.

The decision tree learning algorithm

The basic algorithm used in decision trees is known as the ID3 (by Quinlan) algorithm. The
ID3 algorithm builds decision trees using a top-down, greedy approach. Briefly, the steps to
the algorithm are: - Select the best attribute → A - Assign A as the decision attribute (test
case) for the NODE. - For each value of A, create a new descendant of the NODE. - Sort the
training examples to the appropriate descendant node leaf. - If examples are perfectly
classified, then STOP else iterate over the new leaf nodes.

Pseudocode: ID3 is a greedy algorithm that grows the tree top-down, at each node selecting
the attribute that best classifies the local training examples. This process continues until the
tree perfectly classifies the training examples or until all attributes have been used.

The pseudocode assumes that the attributes are discrete and that the classification is binary.
Examples are the training example. Target_attribute is the attribute whose value is to be
predicted by the tree. Attributes is a list of other attributes that may be tested by the learned
decision tree. Finally, it returns a decision tree that correctly classifies the given Examples.

Tree learning as variance reduction

Reduction in Variance

Reduction in variance is an algorithm used for continuous target variables (regression

problems). This algorithm uses the standard formula of variance to choose the best split. The
split with lower variance is selected as the criteria to split the population:

Above X-bar is mean of the values, X is actual and n is number of values.

3
Steps to calculate Variance:

1. Calculate variance for each node.

2. Calculate variance for each split as weighted average of each node variance.

Rule models

A rule-based system is used to store and manipulate knowledge to interpret information in a

useful way. It is often used in artificial intelligence applications and research.

Learning ordered rule lists

Learning Rules
In learning rules, we are interested in learning rules of the form:
if A1 ^ A2 ^ . . . then C
where A1, A2, . . . are the preconditions/constraints/body/ antecedents of the rule and C is the
postcondition/head/ consequent of the rule.
A first-order rule can contain variables, as in:
if Parent(x, z) ^ Ancestor(z, y) then Ancestor(x, y)
A Horn clause is a rule with no negations.

Learning Rules from Decision Trees

Each path in a decision tree from root to a leaf:

can be converted to a rule, e.g.:

if ¬windy ^ hot ^ sunny then bad
Why? Rules are considered more readable/understandable.
Different pruning decisions can be made for different rules.
Rule Post-Pruning
Rule post-pruning removes conditions to improve error.
if windy ^ sunny ^ high then bad
if windy ^ sunny ^ normal then good
if windy ^ overcast then good

4
if windy ^ rain then bad
if ¬ windy ^ hot ^ sunny then bad
if ¬windy ^ hot ^ overcast then good
if ¬windy ^ mild ^ sunny then bad
if ¬windy ^ mild ^ rain then good
if ¬windy ^ cool then good
Rule Ordering
Order rules by accuracy and coverage.
4 if overcast then good
3 if ¬windy ^ rain then good
3 if sunny ^ high then bad
2 if sunny ^ normal then good
2 if ¬windy ^ cool then good
2 if windy ^ rain then bad
2 if hot ^ sunny then bad
1 if ¬windy ^ mild ^ sunny then bad
Sequential Covering Algorithms
Basic Algorithmic Idea:
1. Learn one good rule.
2. Remove the examples covered by the rule.
3. Repeat until no examples are left.
Learning First-Order Rules
Suppose we are interested in learning concepts involving relationships between objects.
• When one person is an ancestor of another number.
• When one number is less than another number.
• When one node can reach another node in a graph.
• When an element is in a set.
The concept involves intermediate objects and relations.

Learning Unordered rule sets

Uber Cracker Sheet
No ratings yet
Uber Cracker Sheet
39 pages
Chapter 18 Simplex-Based Sensitivity Analysis and Duality
No ratings yet
Chapter 18 Simplex-Based Sensitivity Analysis and Duality
20 pages
Decision Tree
No ratings yet
Decision Tree
21 pages
Decision Tree
No ratings yet
Decision Tree
16 pages
Decision Trees and Regression Techniques
No ratings yet
Decision Trees and Regression Techniques
27 pages
Lecture 7 Overview of ML models
No ratings yet
Lecture 7 Overview of ML models
77 pages
Unit-3 Decision Tree Learning (Februray 26, 2024)
No ratings yet
Unit-3 Decision Tree Learning (Februray 26, 2024)
51 pages
Lecture Note #5_PEC-CS701E
No ratings yet
Lecture Note #5_PEC-CS701E
16 pages
AIML Module 4 Imp
No ratings yet
AIML Module 4 Imp
5 pages
Decision Tree
No ratings yet
Decision Tree
43 pages
Decision Tree
No ratings yet
Decision Tree
18 pages
Decision Trees For Predictive Modeling (Neville)
100% (1)
Decision Trees For Predictive Modeling (Neville)
24 pages
Decisiontree
No ratings yet
Decisiontree
6 pages
Decision Trees
No ratings yet
Decision Trees
8 pages
Decision Tree Learning
No ratings yet
Decision Tree Learning
11 pages
Data Minin1
No ratings yet
Data Minin1
104 pages
AI&Ml-module 4 (Part 1)
No ratings yet
AI&Ml-module 4 (Part 1)
85 pages
AI&Ml-module 4 (Complete)
No ratings yet
AI&Ml-module 4 (Complete)
124 pages
Learning From Examples Homework
No ratings yet
Learning From Examples Homework
22 pages
Decision Treesnotes
No ratings yet
Decision Treesnotes
3 pages
Supervised Learning-Classification Part-4 Divide and Conquer
No ratings yet
Supervised Learning-Classification Part-4 Divide and Conquer
32 pages
FALLSEM2024-25 BCSE209L TH VL2024250101598 2024-08-05 Reference-Material-I
No ratings yet
FALLSEM2024-25 BCSE209L TH VL2024250101598 2024-08-05 Reference-Material-I
31 pages
Unit 3
No ratings yet
Unit 3
31 pages
decision tree
No ratings yet
decision tree
13 pages
Decision Tree
No ratings yet
Decision Tree
45 pages
Wk. 5.2. Decision Trees (27.10.2020)
No ratings yet
Wk. 5.2. Decision Trees (27.10.2020)
57 pages
Decision Tree Is An Upside
No ratings yet
Decision Tree Is An Upside
17 pages
Konsep Ensemble
No ratings yet
Konsep Ensemble
52 pages
Apznzayn4iudcvxyoppqs61j04 7hfvwveb4orry3irmq7ekrlv08lh81olz64cb1ycwzmxuattzrg0ox0g-e Tcprei1i3bwhbnbqofqhvtixwokm0ftaoxwee3znpcytoh6jgknlof6 Rukjysosqdyan8wfbovpzrikmrpeywyu07ft Vvpsanuerxuhcghc7g6sd4pcyi9z-Wao8bn
No ratings yet
Apznzayn4iudcvxyoppqs61j04 7hfvwveb4orry3irmq7ekrlv08lh81olz64cb1ycwzmxuattzrg0ox0g-e Tcprei1i3bwhbnbqofqhvtixwokm0ftaoxwee3znpcytoh6jgknlof6 Rukjysosqdyan8wfbovpzrikmrpeywyu07ft Vvpsanuerxuhcghc7g6sd4pcyi9z-Wao8bn
20 pages
MCA3 (DS) Unit 4 ML
No ratings yet
MCA3 (DS) Unit 4 ML
29 pages
ML UNIT 2-2-40
No ratings yet
ML UNIT 2-2-40
39 pages
Unit Ii
No ratings yet
Unit Ii
22 pages
Decision Trees
No ratings yet
Decision Trees
15 pages
Data Structures: Notes For Lecture 13 Techniques of Data Mining by Samaher Hussein Ali
No ratings yet
Data Structures: Notes For Lecture 13 Techniques of Data Mining by Samaher Hussein Ali
8 pages
Data Mining Notes Unit 4
No ratings yet
Data Mining Notes Unit 4
30 pages
ML Unit-2.1
No ratings yet
ML Unit-2.1
17 pages
Decision Trees
No ratings yet
Decision Trees
7 pages
Session 17-Decision Tree
No ratings yet
Session 17-Decision Tree
16 pages
Chapter 5 2018 2019
No ratings yet
Chapter 5 2018 2019
5 pages
Lec-3-Decision Trees
No ratings yet
Lec-3-Decision Trees
47 pages
Decision Tree
No ratings yet
Decision Tree
34 pages
Machine Learning
No ratings yet
Machine Learning
8 pages
Decision Trees and How To Build and Optimize Decision Tree Classifier
No ratings yet
Decision Trees and How To Build and Optimize Decision Tree Classifier
16 pages
Decision Lists and Trees
No ratings yet
Decision Lists and Trees
29 pages
Decision Tree Algorithm, Explained
No ratings yet
Decision Tree Algorithm, Explained
20 pages
Unit 5. Decision Trees
No ratings yet
Unit 5. Decision Trees
58 pages
Decision Tree Algorithm: and Classification Problems Too
No ratings yet
Decision Tree Algorithm: and Classification Problems Too
12 pages
4.3-DecisionTreesLearningAlgorithms Part 2
No ratings yet
4.3-DecisionTreesLearningAlgorithms Part 2
15 pages
Module 5 - Supervised Learning Algorithms
No ratings yet
Module 5 - Supervised Learning Algorithms
38 pages
Decision Trees: A Recent Overview: S. B. Kotsiantis
No ratings yet
Decision Trees: A Recent Overview: S. B. Kotsiantis
23 pages
Lecturenotes DecisionTree Spring15
No ratings yet
Lecturenotes DecisionTree Spring15
16 pages
PR GTU IMP questions by jay
No ratings yet
PR GTU IMP questions by jay
35 pages
Screenshot 2024-02-06 at 1.43.15 PM
No ratings yet
Screenshot 2024-02-06 at 1.43.15 PM
66 pages
Cours #4—Decision Tree
No ratings yet
Cours #4—Decision Tree
18 pages
Decision Tree Learning
No ratings yet
Decision Tree Learning
15 pages
Decision Tree
No ratings yet
Decision Tree
4 pages
Unit 4
No ratings yet
Unit 4
33 pages
DM Module 4
No ratings yet
DM Module 4
12 pages
STA555 Data Mining: Decision Trees
No ratings yet
STA555 Data Mining: Decision Trees
40 pages
WIREs Computational Stats - 2013 - de Ville - Decision Trees
No ratings yet
WIREs Computational Stats - 2013 - de Ville - Decision Trees
8 pages
High School Pre-Calculus Tutor
From Everand
High School Pre-Calculus Tutor
The Editors of REA
4/5 (1)
Alternating Decision Tree: Fundamentals and Applications
From Everand
Alternating Decision Tree: Fundamentals and Applications
Fouad Sabry
No ratings yet
Oral Care
No ratings yet
Oral Care
6 pages
(Ebook) Automatic Control Systems, 8th ed. (Solutions Manual) by Benjamin C. Kuo, Farid Golnaraghi ISBN 9780471134763 all chapter instant download
100% (2)
(Ebook) Automatic Control Systems, 8th ed. (Solutions Manual) by Benjamin C. Kuo, Farid Golnaraghi ISBN 9780471134763 all chapter instant download
81 pages
Iat-3 Physics Question Paper
No ratings yet
Iat-3 Physics Question Paper
2 pages
MP1 v01
No ratings yet
MP1 v01
3 pages
48 Daa Lab2
No ratings yet
48 Daa Lab2
9 pages
Comparing Prim's With Kruskal's Algo
No ratings yet
Comparing Prim's With Kruskal's Algo
12 pages
Lecture 4 - Variational Divergence Minimization or Adversarial Learning
No ratings yet
Lecture 4 - Variational Divergence Minimization or Adversarial Learning
21 pages
Differential Equations in Maple
No ratings yet
Differential Equations in Maple
14 pages
Assignment 3: School of Computer Sciences Semester 2, Academic Session 2016/2017 CPT 111/CPM 111 Principle of Programming
No ratings yet
Assignment 3: School of Computer Sciences Semester 2, Academic Session 2016/2017 CPT 111/CPM 111 Principle of Programming
6 pages
Ch02 DSS BI
No ratings yet
Ch02 DSS BI
91 pages
Handbook of Applied Econometrics and Statistical Inference 1st Edition Viktor K. Jirsa
No ratings yet
Handbook of Applied Econometrics and Statistical Inference 1st Edition Viktor K. Jirsa
84 pages
Principles of Actuarial Science Games of Chance (1) : Week 1
No ratings yet
Principles of Actuarial Science Games of Chance (1) : Week 1
23 pages
L5
No ratings yet
L5
12 pages
Homework 4 Question 27 Solution
No ratings yet
Homework 4 Question 27 Solution
2 pages
Research Article Secure Data Transmission Using Quantum Cryptography in Fog Computing
No ratings yet
Research Article Secure Data Transmission Using Quantum Cryptography in Fog Computing
8 pages
Pie Chart
No ratings yet
Pie Chart
19 pages
Lab 10
No ratings yet
Lab 10
11 pages
SIDE CHANNEL AND TIMING ATTACK
No ratings yet
SIDE CHANNEL AND TIMING ATTACK
12 pages
Exercise 5 Answer Key
No ratings yet
Exercise 5 Answer Key
2 pages
Pe 3032wk1introductiontocontrolsystemmarch04e 141007091431 Conversion Gate01
No ratings yet
Pe 3032wk1introductiontocontrolsystemmarch04e 141007091431 Conversion Gate01
69 pages
Practice Exercise - Capital Budgeting - NPV and IRR DONE
No ratings yet
Practice Exercise - Capital Budgeting - NPV and IRR DONE
17 pages
Sheet 5 (3)
No ratings yet
Sheet 5 (3)
3 pages
Correlation Anad Regression
No ratings yet
Correlation Anad Regression
13 pages
19-ME-47 FEM Lecture 5 Assignment
No ratings yet
19-ME-47 FEM Lecture 5 Assignment
3 pages
CP 10
No ratings yet
CP 10
18 pages
AD3511-DEEP LEARNING LAB MANUAL Revised
No ratings yet
AD3511-DEEP LEARNING LAB MANUAL Revised
72 pages
Autoencoder
No ratings yet
Autoencoder
24 pages
Amar Sahay - Business Analytics, Volume II - A Data Driven Decision Making Approach For Business-Business Expert Press (2019) PDF
100% (2)
Amar Sahay - Business Analytics, Volume II - A Data Driven Decision Making Approach For Business-Business Expert Press (2019) PDF
405 pages

ML Unit-3

Uploaded by

ML Unit-3

Uploaded by

Unit-3

Day Weather Temperature Humidity Wind Play?

A general algorithm for a decision tree can be described as follows:

Expressiveness of decision trees

Boolean Function: AND

The decision tree learning algorithm

Tree learning as variance reduction

Reduction in variance is an algorithm used for continuous target variables (regression

Above X-bar is mean of the values, X is actual and n is number of values.

1. Calculate variance for each node.

A rule-based system is used to store and manipulate knowledge to interpret information in a

Learning ordered rule lists

Learning Rules from Decision Trees

Each path in a decision tree from root to a leaf:

can be converted to a rule, e.g.:

Learning Unordered rule sets

You might also like