Department of Electronics & Telecommunications Engineering: ETEL71A-Machine Learning and AI

The document describes implementing the ID3 decision tree algorithm to classify a dataset on social media purchases, including preprocessing the data, fitting a decision tree classifier to the training data, using the trained model to predict test data labels, and evaluating the model's accuracy. Code is provided to split data, scale features, train a decision tree, predict test labels, and calculate accuracy and a confusion matrix. The results show high classification accuracy from the decision tree model on this dataset.

Uploaded by

Shrey Dixit

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

74 views

Department of Electronics & Telecommunications Engineering: ETEL71A-Machine Learning and AI

Uploaded by

Shrey Dixit

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 4

Department of Electronics & Telecommunications

Engineering
ETEL71A-Machine Learning and AI
Class: BE
Name : Adya Kastwar
UID: 2016120024
Sem: VII
Experiment: Decision Tree (ID3) algorithm

Objective: Write Python program to demonstrate the working of the decision tree based ID3
algorithm by using appropriate data set for building the decision tree and apply this
knowledge to classify a new sample.
Outcomes:
1. Find entropy of data and follow steps of the algorithm to construct a tree.
2. Representation of hypothesis using decision tree.
3. Apply Decision Tree algorithm to classify the given data.
4. Interpret the output of Decision Tree.

System Requirements:
Linux OS with Python and libraries or R or windows with MATLAB

Task 1: Describe the ID3 algorithm:

a. Note down different Decision Tree algorithms and understand the steps of ID3 algorithm.
b. Solve the algorithm and form a hypothesis in note book for the following ‘Family Dogs
characteristics’ training set. Verify the ‘Characteristic’ for the testing set and say whether the
target concept has been learnt successfully.
Decision trees are supervised learning algorithms used for both, classification and regression tasks.
Decision trees are assigned to the information based learning algorithms which use different
measures of information gain for learning.
The main idea of decision trees is to find those descriptive features which contain the most
"information" regarding the target feature and then split the dataset along the values of these
features such that the target feature values for the resulting subdatasets are as pure as possible. The
descriptive feature which leaves the target feature most purely is said to be the most informative
one. This process of finding the "most informative" feature is done until we accomplish a stopping
criteria where we then finally end up in so called leaf nodes.
Assumptions we make while using Decision tree :
 At the beginning, we consider the whole training set as the root.
 Attributes are assumed to be categorical for information gain and for gini index, attributes
are assumed to be continuous.
 On the basis of attribute values records are distributed recursively.
 We use statistical methods for ordering attributes as root or internal node.
Pseudocode :
1. Find the best attribute and place it on the root node of the tree.
2. Now, split the training set of the dataset into subsets. While making the subset make
sure that each subset of training dataset should have the same value for an
attribute.
3. Find leaf nodes in all branches by repeating 1 and 2 on each subset.
Entropy is the measure of uncertainty of a random variable, it characterizes the impurity of an
arbitrary collection of examples. The higher the entropy the more the information content.
 The entropy typically changes when we use a node in a decision tree to partition the training
instances into smaller subsets. Information gain is a measure of this change in entropy.
 Sklearn supports “entropy” criteria for Information Gain and if we want to use Information
Gain method in sklearn then we have to mention it explicitly.

Task 2: Implement the algorithm in python/R/Matlab to classify the dataset.

Dataset: online database on purchasrs made based on social media ads

Code:
#Decision Tree Classification
#Adya kastwar
#Importing libraries
import numpy as np
import matplotlib.pyplot as plt
import pandas as pd

#Importing the dataset

dataset = pd.read_csv('Social_Network_Ads.csv')
#Creating seperate matrices for features and outputs
X = dataset.iloc[:80, [2, 3]].values
y = dataset.iloc[:80, 4].values

#Splitting the dataset into the training and test set

from sklearn.model_selection import train_test_split
X_train, X_test, y_train, y_test = train_test_split(X, y, test_size = 0.25, random_state = 0)

#Feature scaling so as to bring values of attributes to the same scale to avoid errors
from sklearn.preprocessing import StandardScaler
sc = StandardScaler()
X_train = sc.fit_transform(X_train)
X_test = sc.transform(X_test)

#Fitting decision tree classification to the train set

from sklearn.tree import DecisionTreeClassifier
classifier = DecisionTreeClassifier(criterion = 'entropy', random_state = 0)
classifier.fit(X_train, y_train)

#Using trained model to predict test set results

y_pred = classifier.predict(X_test)

#results as accuracy and confusion matrix

from sklearn.metrics import confusion_matrix
cm = confusion_matrix(y_test, y_pred)
print("The confusion matrix is : ", cm)
print("The prediction accuracy is: ",classifier.score(X_test,y_test)*100,"%")

Output :
Conclusion:
 High accuracy of classification was obtained using decision tree based on entropy.
 As the number of examples increased from 80 to 400, the prediction accuracy increased
greatly.

Gpa-Cgpa Formula With Example
53% (15)
Gpa-Cgpa Formula With Example
3 pages
dwm_06
No ratings yet
dwm_06
4 pages
MLT Experiment 3
No ratings yet
MLT Experiment 3
3 pages
Soft Computing Lab Practical Assignment 2
No ratings yet
Soft Computing Lab Practical Assignment 2
10 pages
Decision Trees
No ratings yet
Decision Trees
53 pages
Practical 1ritesh
No ratings yet
Practical 1ritesh
3 pages
DM Lect 9_Classification - Decision Trees
No ratings yet
DM Lect 9_Classification - Decision Trees
39 pages
ML - Unit 2 - Part I
No ratings yet
ML - Unit 2 - Part I
15 pages
practical 15 python
No ratings yet
practical 15 python
6 pages
M01 Tree-Based Methods
No ratings yet
M01 Tree-Based Methods
38 pages
Machine Learning
No ratings yet
Machine Learning
8 pages
14MachineLearningDecisionTreeRandomForest - Ipynb - Colaboratory
No ratings yet
14MachineLearningDecisionTreeRandomForest - Ipynb - Colaboratory
29 pages
decision tree
No ratings yet
decision tree
5 pages
Unit IV Notes
No ratings yet
Unit IV Notes
20 pages
Practical No4 - 5 ML
No ratings yet
Practical No4 - 5 ML
11 pages
Decision Tree and Related Techniques For Classification in Scalation
No ratings yet
Decision Tree and Related Techniques For Classification in Scalation
12 pages
Types of Pruning Techniques
No ratings yet
Types of Pruning Techniques
10 pages
2167TC1 Lab
No ratings yet
2167TC1 Lab
8 pages
Unit-3 Alt
No ratings yet
Unit-3 Alt
24 pages
Module 3
No ratings yet
Module 3
103 pages
Classification
No ratings yet
Classification
148 pages
Unit-III Decision Tree: Course In-Charges
No ratings yet
Unit-III Decision Tree: Course In-Charges
69 pages
Machine Learning Lab: Delhi Technological University
No ratings yet
Machine Learning Lab: Delhi Technological University
6 pages
Lecture 6 - Decision Trees
No ratings yet
Lecture 6 - Decision Trees
43 pages
Title: Implementation of Decision Tree Classification: Department of Computer Science and Engineering
No ratings yet
Title: Implementation of Decision Tree Classification: Department of Computer Science and Engineering
8 pages
Decision_tree
No ratings yet
Decision_tree
15 pages
Experiment No-2
No ratings yet
Experiment No-2
4 pages
DSML Practical
No ratings yet
DSML Practical
4 pages
Module 2 Notes
No ratings yet
Module 2 Notes
20 pages
2024-Lecture11-MLAlgorithms
No ratings yet
2024-Lecture11-MLAlgorithms
84 pages
DM Lab 04
No ratings yet
DM Lab 04
6 pages
MLT Unit 3
100% (1)
MLT Unit 3
38 pages
Experiment No 4 Vanraj
No ratings yet
Experiment No 4 Vanraj
2 pages
ML_UNIT3
No ratings yet
ML_UNIT3
24 pages
5b Python Implementation of Decision Tree
No ratings yet
5b Python Implementation of Decision Tree
7 pages
ML Unit 3 New
100% (1)
ML Unit 3 New
24 pages
UNIT II 2.1 ML Decision Tree Learning
No ratings yet
UNIT II 2.1 ML Decision Tree Learning
55 pages
Unit II Part 1
No ratings yet
Unit II Part 1
62 pages
L3 - Decision Trees
No ratings yet
L3 - Decision Trees
28 pages
3 Decision Tree Learning
No ratings yet
3 Decision Tree Learning
38 pages
Experiment 8
No ratings yet
Experiment 8
14 pages
ML Priyesha - 778
No ratings yet
ML Priyesha - 778
23 pages
AIML Module-04
No ratings yet
AIML Module-04
46 pages
Lab 2
No ratings yet
Lab 2
3 pages
ML UNIT 2-2-40
No ratings yet
ML UNIT 2-2-40
39 pages
Day 5 Supervised Technique-Decision Tree For Classification PDF
100% (1)
Day 5 Supervised Technique-Decision Tree For Classification PDF
58 pages
ML lab manual
No ratings yet
ML lab manual
25 pages
Lab # 10
No ratings yet
Lab # 10
6 pages
Lecture 7.2 - DTC Algorithm Implementation
No ratings yet
Lecture 7.2 - DTC Algorithm Implementation
7 pages
Practice 2+
No ratings yet
Practice 2+
25 pages
Lecture 17 18
No ratings yet
Lecture 17 18
52 pages
Data Mining Classification Algorithms: Credits: Padhraic Smyth
No ratings yet
Data Mining Classification Algorithms: Credits: Padhraic Smyth
54 pages
AIH_Lab2
No ratings yet
AIH_Lab2
10 pages
ml6
No ratings yet
ml6
15 pages
K.Venkat Ratnam 191911412 Class Work 1) Describe The Attribute Selection Measures Used by The ID3 Algorithm To Construct A Decision Tree. A)
No ratings yet
K.Venkat Ratnam 191911412 Class Work 1) Describe The Attribute Selection Measures Used by The ID3 Algorithm To Construct A Decision Tree. A)
8 pages
Machine Learning (Se204A) Lab Manual
No ratings yet
Machine Learning (Se204A) Lab Manual
27 pages
ML-3-Decision Tree
No ratings yet
ML-3-Decision Tree
17 pages
Data Mining Assignment No. 1
No ratings yet
Data Mining Assignment No. 1
7 pages
Chapter#03 Supervised Learning and Its Algorithms - III
No ratings yet
Chapter#03 Supervised Learning and Its Algorithms - III
29 pages
Chapter 03
No ratings yet
Chapter 03
30 pages
Artificial Intelligence Algorithms
From Everand
Artificial Intelligence Algorithms
akosnemeth
No ratings yet
Psychology For Dummies 3rd Edition Adam Cash Psyd instant download
100% (2)
Psychology For Dummies 3rd Edition Adam Cash Psyd instant download
52 pages
Barisic Provic 97 107
No ratings yet
Barisic Provic 97 107
12 pages
1-s2.0-S0747563220304209-main(1)
No ratings yet
1-s2.0-S0747563220304209-main(1)
3 pages
Lesson Plan Micro Teaching On Continue Education
No ratings yet
Lesson Plan Micro Teaching On Continue Education
5 pages
Document TWT Cheat Sheet Checks For Understanding
No ratings yet
Document TWT Cheat Sheet Checks For Understanding
1 page
Progress Report For The Month of March
No ratings yet
Progress Report For The Month of March
2 pages
3rd Quarter Week 6 - Day 4
No ratings yet
3rd Quarter Week 6 - Day 4
3 pages
Instructional Systems Design Syllabus
No ratings yet
Instructional Systems Design Syllabus
10 pages
Pedagogy Vs Andragogy
No ratings yet
Pedagogy Vs Andragogy
5 pages
Observer - Dec 2018 - Notification With List
No ratings yet
Observer - Dec 2018 - Notification With List
9 pages
Curriculum Map
No ratings yet
Curriculum Map
3 pages
Classroom & Surface Management
No ratings yet
Classroom & Surface Management
24 pages
(Ebook) University Success Reading Intermediate To High-intermediate, Student Book With Myenglishlab by Carrie Steenburgh ISBN 9780134653228, 013465322X - Download the ebook and start exploring right away
100% (3)
(Ebook) University Success Reading Intermediate To High-intermediate, Student Book With Myenglishlab by Carrie Steenburgh ISBN 9780134653228, 013465322X - Download the ebook and start exploring right away
69 pages
Lesson Plan of JF 5th Week
No ratings yet
Lesson Plan of JF 5th Week
4 pages
E-Portfolio in FS 1
No ratings yet
E-Portfolio in FS 1
220 pages
Tutor 1
No ratings yet
Tutor 1
12 pages
Lisa Flicker Part B Gapss Itec 7460
No ratings yet
Lisa Flicker Part B Gapss Itec 7460
16 pages
G8 DAY2 2B Genes, DNA and Chromosomes
No ratings yet
G8 DAY2 2B Genes, DNA and Chromosomes
3 pages
21st Century Classroom
No ratings yet
21st Century Classroom
2 pages
Class Misbehavior Quali
No ratings yet
Class Misbehavior Quali
8 pages
intermediate reading
No ratings yet
intermediate reading
4 pages
Leadership and Followership
No ratings yet
Leadership and Followership
10 pages
Daily Lesson Log On "Martian Sends Postcard Home"
No ratings yet
Daily Lesson Log On "Martian Sends Postcard Home"
8 pages
AI Unit 4
No ratings yet
AI Unit 4
15 pages
bird mark,+Journal+manager,+25 เพ็ญพนอ+พ่วงแพ
No ratings yet
bird mark,+Journal+manager,+25 เพ็ญพนอ+พ่วงแพ
10 pages
Internship Report
No ratings yet
Internship Report
23 pages
Abrantes Gil Carla LDM 2 Final 2020 2021
No ratings yet
Abrantes Gil Carla LDM 2 Final 2020 2021
18 pages
Lesson Plan Template: NAME: - EPC 2403
No ratings yet
Lesson Plan Template: NAME: - EPC 2403
3 pages
English Test 1
No ratings yet
English Test 1
2 pages

Department of Electronics & Telecommunications Engineering: ETEL71A-Machine Learning and AI

Uploaded by

Department of Electronics & Telecommunications Engineering: ETEL71A-Machine Learning and AI

Uploaded by

Department of Electronics & Telecommunications

Task 1: Describe the ID3 algorithm:

Task 2: Implement the algorithm in python/R/Matlab to classify the dataset.

Dataset: online database on purchasrs made based on social media ads

#Importing the dataset

#Splitting the dataset into the training and test set

#Fitting decision tree classification to the train set

#Using trained model to predict test set results

#results as accuracy and confusion matrix

You might also like