0% found this document useful (0 votes)

11 views34 pages

Yapay Zeka Ve Makine Öğrenmesi 10

yapay zeka ders notları

Uploaded by

erenyusufasim

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

11 views34 pages

Yapay Zeka Ve Makine Öğrenmesi 10

yapay zeka ders notları

Uploaded by

erenyusufasim

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 34

Lecture#5

Decision Support
Decision Support

• One of the earliest AI problems was decision support

• The first solution to this problem was expert systems
• They used an often very large number of hand-
crafted if-then rules
• These problems are suitable for a type of algorithms
called Decision Trees
• The dataset typically mostly contains categorical
features, but can have numerical features as well.
Decision Trees

• Decision Trees has one big advantage: the trained

model is easy to visualize and interpret
• We can understand what the algorithm has learned
• This can be important in some applications where
we want to investigate why the system took a
decision
• This is commonly referred to as a completely
transparent method
Example: Weather dataset
Outlook Temperature Humidity Windy Play
sunny hot high false NO
sunny hot high true NO
overcast hot high false YES
rainy mild high false YES
rainy cool normal false YES
rainy cool normal true NO
overcast cool normal true YES
sunny mild high false NO
sunny cool normal false YES
rainy mild normal false YES
sunny mild normal true YES
overcast mild high true YES
overcast hot normal false YES
rainy mild high true NO
Building the tree

• At each node, we need to find the attribute that best divides the
data into Yes and No.
• To do this we calculate the information gain for each parameter
and value.
• The attribute with the highest information gain is selected at
each node.
Find the root node
Outlook Sunny Overcast Rainy
Yes 2 4 3
No 3 0 2
Find the root node
Temperature Hot Mild Cool
Yes 2 4 3
No 2 2 1
Find the root node
Humidity High Normal
Yes 3 6
No 4 1
Find the root node
Windy True False
Yes 3 6
No 3 2
Find the root node
Attribute Gain
Outlook 0.306
Temperature 0.088
Humidity 0.211
Windy 0.107

Outlook has the highest gain and

is selected as root node
Find the root node
outlook
sunny rainy
overcast

yes

Overcast has perfect gain = all examples

belongs to the same category: Yes

Let’s find the sunny node!

All examples with sunny
Outlook Temperature Humidity Windy Play
sunny hot high false NO
sunny hot high true NO
sunny mild high false NO
sunny cool normal false YES
sunny mild normal true YES

• Now we use a subset of the data

• It contains all examples with Outlook = sunny
• 5 examples
Find the sunny node
Temperature Hot Mild Cool
Yes 0 1 1
No 2 1 0
Find the sunny node
Humidity High Normal
Yes 0 2
No 3 0
Find the sunny node

Windy True False

Yes 1 1
No 1 2
Find the sunny node

outlook
sunny rainy
overcast

humidity yes

high normal

no yes Since humidity has perfect gain

it is selected

Let’s find the rainy node!

All examples with rainy

Outlook Temperature Humidity Windy Play

rainy mild high false YES
rainy cool normal false YES
rainy cool normal true NO
rainy mild normal false YES
rainy mild high true NO

• Again, we use a subset of the data

• It contains all examples with Outlook = rainy
• 5 examples
Find the rainy node
Temperature Hot Mild Cool
Yes 0 2 1
No 0 1 1
Find the rainy node
Humidity High Normal
Yes 1 2
No 1 1
Find the rainy node
Windy True False
Yes 0 3
No 2 0

Since windy has perfect gain,

it is selected
Final tree

outlook
sunny rainy
overcast

humidity yes windy

high normal false true

no yes yes no
The problem

• In most cases, there are several possible trees that

can be generated
• The aim is to:
1. Generate a tree that as accurately as possible can classify
the training data
2. Generate the smallest possible tree
• It can be tricky to satisfy both
• The first is of highest priority
Generating a good tree

• There is a wide range of different algorithms for

generating decision trees
• Each tries to fulfill both criteria as much as possible
• Weka uses an algorithm called J48
Classification

• To classify an example, we need to traverse the tree

by following the nodes that matches the attribute
values in the example
• When we reach a leaf node, the result (category) is
returned
Overfitting

• Decision Trees can suffer from overfitting

• It means that the model learned is very specific to
the training data, but can be bad at classifying
unknown examples
• To get around this problem, learning is usually
stopped before there is a risk of overfitting
Overfitting

• A common approach to reduce overfitting in

Decision Trees is to stop creating more branches if
there is only a very small increase in gain
• We can set a minimum threshold of how large the
gain must be to allow a new branch to be created
• There is no universal answer to which limit to use
• You have to experiment on the dataset you use
When to use Decision Trees

• As mentioned, one big advantage of DTs is that we

can interpret the trained model
• There are some other benefits of DTs
• They work on both numerical and nominal attributes
without pre-processing the data, which many other
algorithms don’t
• They also support probabilistic reasoning of
assignments, which we did when we returned the
most probable category
When to use Decision Trees

• The major drawback is that DTs are not very good for complex
learning problems
• If we have lots of categories, the decision tree tends to be very
complicated and will most likely make poor predictions
• Another disadvantage is that they can only do simple greater-
than/less-than decisions for numerical attributes
• They work best if we have combinations of numerical and
nominal data, and few categories (which many real-world
problems satisfy)
Weka
• Weka’s standard Decision Tree classifier is called
J48.
• When using J48 on the Weather dataset we get the
following result:
R

• In R, we can use an algorithm called CART

• The dataset needs to be in csv format
• The R script looks like this:
R script
#Load the ML library
library(caret)

#Read the dataset

dataset <- read.csv("FIFA_skill.csv")

#setup 10-fold cross validation

control <- trainControl(method="cv", number=10)
metric <- "Accuracy"

#Train model using CART

set.seed(7)
cart <- train(PlayerSkill~., data=dataset, method="rpart",
metric=metric, trControl=control)

#Print result
print(cart)
R result
•Warning
> message:
In nominalTrainWorkflow(x = x, y = y, wts = weights, info = trainInfo, :
There result
>#Print were missing values in resampled performance measures.
>print(cart) CART

• 19 samples
• 3 predictor
• 2 classes: 'bad', 'good'

• No pre-processing
• Resampling: Cross-Validated (10 fold)
• Summary of sample sizes: 17, 17, 17, 17, 17, 17, ... Resampling results:

• Accuracy Kappa
• 0.55 0
R result

• The warning message from R means that the 10-fold

CV split the dataset so one class was missing in
some iteration
• This has large impact on the result
• R needs more data to accurately predict the dataset
• If we make a copy of each example in the dataset
(twice as much data), the result is:
R result
CART

38 samples
3 predictor
2 classes: 'bad', 'good'

No pre-processing
Resampling: Cross-Validated (10 fold)
Summary of sample sizes: 35, 34, 34, 34, 34, 34, ...
Resampling results across tuning parameters:

cp Accuracy Kappa
0.0000000 0.8416667 0.69
0.3333333 0.8416667 0.69
0.6666667 0.6083333 0.19

Accuracy was used to select the optimal model using the largest value.
The final value used for the model was cp = 0.3333333.

Decision Tree: Courtesy: Prof. Pabitra Mitra, CSE, IIT Kharagpur
No ratings yet
Decision Tree: Courtesy: Prof. Pabitra Mitra, CSE, IIT Kharagpur
73 pages
Decision Trees: Decision Tree Representation ID3 Learning Algorithm Entropy, Information Gain Overfitting
No ratings yet
Decision Trees: Decision Tree Representation ID3 Learning Algorithm Entropy, Information Gain Overfitting
33 pages
AI Lecture 9
No ratings yet
AI Lecture 9
69 pages
2.3 Decision-Tree-Algorithm
No ratings yet
2.3 Decision-Tree-Algorithm
61 pages
L5 - Decision Tree - B
No ratings yet
L5 - Decision Tree - B
51 pages
07 - ML - Decision Tree
No ratings yet
07 - ML - Decision Tree
37 pages
Dtrees
No ratings yet
Dtrees
43 pages
Lecture 5a
No ratings yet
Lecture 5a
24 pages
Classification and Clustering
No ratings yet
Classification and Clustering
59 pages
Lec 16,17
No ratings yet
Lec 16,17
90 pages
M01 Tree-Based Methods
No ratings yet
M01 Tree-Based Methods
38 pages
Lecture 07 On Decision Trees
No ratings yet
Lecture 07 On Decision Trees
36 pages
7-Decision Trees Learning
No ratings yet
7-Decision Trees Learning
51 pages
Lecture 8
No ratings yet
Lecture 8
28 pages
AIML Lect5 Decision Tree
No ratings yet
AIML Lect5 Decision Tree
33 pages
Decision Tree
No ratings yet
Decision Tree
14 pages
Module 3 Chap 3 Decision Tree Learning
No ratings yet
Module 3 Chap 3 Decision Tree Learning
79 pages
Unit 5. Decision Trees
No ratings yet
Unit 5. Decision Trees
58 pages
6 DecisionTrees ID3 CART
No ratings yet
6 DecisionTrees ID3 CART
24 pages
Mod 4-1
No ratings yet
Mod 4-1
42 pages
Ch4 Supervised
No ratings yet
Ch4 Supervised
78 pages
Decision Trees
No ratings yet
Decision Trees
53 pages
Week 11 - Decision Tree Learning
No ratings yet
Week 11 - Decision Tree Learning
43 pages
Decision Tree
No ratings yet
Decision Tree
16 pages
Unit II Part 1
No ratings yet
Unit II Part 1
62 pages
Geometric Intuition of Decision Tree: Axis Parallel Hyperplanes
No ratings yet
Geometric Intuition of Decision Tree: Axis Parallel Hyperplanes
7 pages
MIS410 Chapter6
No ratings yet
MIS410 Chapter6
47 pages
Lecture 19 - Decision Tress
No ratings yet
Lecture 19 - Decision Tress
21 pages
DMDM Part 2
No ratings yet
DMDM Part 2
94 pages
MLA NLP Lecture2
No ratings yet
MLA NLP Lecture2
76 pages
16-Decision Tree Classification Algorithm Advantages With Examples (Iterative Dichotomiser 3-ID3) - 22-03-2024
No ratings yet
16-Decision Tree Classification Algorithm Advantages With Examples (Iterative Dichotomiser 3-ID3) - 22-03-2024
83 pages
Decision Trees: Decision Tree Is One of The Most Widely Used and
No ratings yet
Decision Trees: Decision Tree Is One of The Most Widely Used and
53 pages
Data Mining: Classification-1
No ratings yet
Data Mining: Classification-1
53 pages
DMDW Co3 Session 14
No ratings yet
DMDW Co3 Session 14
55 pages
Tutorial 1
No ratings yet
Tutorial 1
4 pages
Module 3-Decision Tree Learning
100% (1)
Module 3-Decision Tree Learning
33 pages
Decision Tree
No ratings yet
Decision Tree
58 pages
MCA3 (DS) Unit 4 ML
No ratings yet
MCA3 (DS) Unit 4 ML
29 pages
ML4 - Decision Trees & Random Forest
No ratings yet
ML4 - Decision Trees & Random Forest
44 pages
Decision Trees
No ratings yet
Decision Trees
15 pages
Lecture2 DT
No ratings yet
Lecture2 DT
75 pages
Lecture Note #5 - PEC-CS701E
No ratings yet
Lecture Note #5 - PEC-CS701E
16 pages
2.decision Tree
No ratings yet
2.decision Tree
56 pages
Ch02 DecisionTree
No ratings yet
Ch02 DecisionTree
41 pages
PR GTU IMP Questions by Jay
No ratings yet
PR GTU IMP Questions by Jay
35 pages
CS446: Machine Learning: Lecture 21 (ML Models - Decision Trees - ID3)
No ratings yet
CS446: Machine Learning: Lecture 21 (ML Models - Decision Trees - ID3)
54 pages
CE880 Lecture7 Slides
No ratings yet
CE880 Lecture7 Slides
78 pages
Lab3 NguyenQuocKhanh ITITIU18186
No ratings yet
Lab3 NguyenQuocKhanh ITITIU18186
7 pages
Decision Trees CLS
No ratings yet
Decision Trees CLS
43 pages
An Introduction TO Decision Trees
No ratings yet
An Introduction TO Decision Trees
30 pages
Random Forest Regression
No ratings yet
Random Forest Regression
57 pages
Decision Tree in ML
No ratings yet
Decision Tree in ML
21 pages
Module - 2 Decision Tree Learning
No ratings yet
Module - 2 Decision Tree Learning
79 pages
3-Classification, Clustering and Prediction
No ratings yet
3-Classification, Clustering and Prediction
142 pages
Chap 18 B
No ratings yet
Chap 18 B
22 pages
Decision Tree
No ratings yet
Decision Tree
5 pages
Decision Tree
No ratings yet
Decision Tree
18 pages
Decision Tree
No ratings yet
Decision Tree
45 pages
Stem Guides To Weather
From Everand
Stem Guides To Weather
Kay Robertson
No ratings yet
Precalculus: A Self-Teaching Guide
From Everand
Precalculus: A Self-Teaching Guide
Steve Slavin
4.5/5 (5)
On The Prediction of Wave Parameters Using Simplified Methods
No ratings yet
On The Prediction of Wave Parameters Using Simplified Methods
5 pages
Beer Fermentation Monitoring of Process Parameters by FTNIR and Multivariate Data Analysis
No ratings yet
Beer Fermentation Monitoring of Process Parameters by FTNIR and Multivariate Data Analysis
8 pages
Basic Laws of Gases and Particulates
No ratings yet
Basic Laws of Gases and Particulates
19 pages
Stock Market Time Series Analysis
No ratings yet
Stock Market Time Series Analysis
12 pages
Pemanasan Global Dan Keanekaragaman Hayati: Tuti Suryati, Fadliah Salim Dan Titiresmi
No ratings yet
Pemanasan Global Dan Keanekaragaman Hayati: Tuti Suryati, Fadliah Salim Dan Titiresmi
8 pages
Point Estimation and Interval Estimation: Learning Objectives
No ratings yet
Point Estimation and Interval Estimation: Learning Objectives
58 pages
Chapter 1-Data and Statistics: Multiple Choice
100% (2)
Chapter 1-Data and Statistics: Multiple Choice
20 pages
OLS Example: Sample Data (X's) : Line of Best Fit (βX)
No ratings yet
OLS Example: Sample Data (X's) : Line of Best Fit (βX)
3 pages
Test Bank For Statistics 5th Edition by Agresti
No ratings yet
Test Bank For Statistics 5th Edition by Agresti
21 pages
Sertifikat Kalibrasi Batching Plant 02
No ratings yet
Sertifikat Kalibrasi Batching Plant 02
3 pages
Estimation Confidence Intervals
No ratings yet
Estimation Confidence Intervals
58 pages
Time Series Analysis Intro
No ratings yet
Time Series Analysis Intro
13 pages
Failure Analysis of Malin Landslide
No ratings yet
Failure Analysis of Malin Landslide
16 pages
Chapter 6
No ratings yet
Chapter 6
9 pages
3 HYDROLOGY Module Final
100% (2)
3 HYDROLOGY Module Final
133 pages
s3950476 TimeSeriesAnalysis Assignment 3
No ratings yet
s3950476 TimeSeriesAnalysis Assignment 3
13 pages
Final Notes On SQC
100% (1)
Final Notes On SQC
12 pages
08 Plasticity 06 Hardening
No ratings yet
08 Plasticity 06 Hardening
9 pages
1878set 1
No ratings yet
1878set 1
2 pages
Sol Assign 1
No ratings yet
Sol Assign 1
2 pages
Edtech Lesson Plan
No ratings yet
Edtech Lesson Plan
6 pages
Boukeloua 2024 CIS-TM
No ratings yet
Boukeloua 2024 CIS-TM
39 pages
CE5730 Assignment3
No ratings yet
CE5730 Assignment3
2 pages
4 Ponchon Savarit Method
No ratings yet
4 Ponchon Savarit Method
47 pages
Comparison of Digestion Methods For Determination of Total Phosphorus in River Sediments PDF
No ratings yet
Comparison of Digestion Methods For Determination of Total Phosphorus in River Sediments PDF
7 pages
Lelm 406
No ratings yet
Lelm 406
29 pages
10 1 1 90 7867 PDF
No ratings yet
10 1 1 90 7867 PDF
11 pages
Statistics Practice Paper 2024
No ratings yet
Statistics Practice Paper 2024
4 pages
Use of Records in Predicting The Outcome of Construction Claims
No ratings yet
Use of Records in Predicting The Outcome of Construction Claims
13 pages
Winkel Reid
No ratings yet
Winkel Reid
17 pages