0% found this document useful (0 votes)

4 views

Decision Tree

The document outlines a Python implementation of a Decision Tree classifier using the scikit-learn library to predict whether to play tennis based on weather conditions. It includes data preprocessing steps, model training, evaluation metrics such as accuracy and confusion matrix, and visualizes the decision tree. The model achieved an accuracy of 1.0 on the test set.

Uploaded by

angelin272004

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

4 views

Decision Tree

Uploaded by

angelin272004

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

DECISION TREE

# Import necessary libraries

import pandas as pd
from sklearn.model_selection import train_test_split
from sklearn.tree import DecisionTreeClassifier, plot_tree
from sklearn.metrics import confusion_matrix, accuracy_score, classification_report
import matplotlib.pyplot as plt

# Load the dataset

data = {
'Outlook': ['Sunny', 'Sunny', 'Overcast', 'Rainy', 'Rainy', 'Rainy', 'Overcast', 'Sunny', 'Sunny',
'Rainy', 'Sunny', 'Overcast', 'Overcast', 'Rainy'],
'Temperature': ['Hot', 'Hot', 'Hot', 'Mild', 'Cool', 'Cool', 'Cool', 'Mild', 'Cool', 'Mild', 'Mild',
'Mild', 'Hot', 'Mild'],
'Humidity': ['High', 'High', 'High', 'High', 'Normal', 'Normal', 'Normal', 'High', 'Normal',
'Normal', 'Normal', 'High', 'Normal', 'High'],
'Wind': ['Weak', 'Strong', 'Weak', 'Weak', 'Weak', 'Strong', 'Strong', 'Weak', 'Weak', 'Weak',
'Strong', 'Strong', 'Weak', 'Strong'],
'PlayTennis': ['No', 'No', 'Yes', 'Yes', 'Yes', 'No', 'Yes', 'No', 'Yes', 'Yes', 'Yes', 'Yes', 'Yes', 'No']
}

# Convert the dictionary to a DataFrame

df = pd.DataFrame(data)

# Convert categorical variables to numerical using one-hot encoding

df = pd.get_dummies(df, columns=['Outlook', 'Temperature', 'Humidity', 'Wind'])

# Separate features and target variable

X = df.drop('PlayTennis', axis=1)
y = df['PlayTennis']
# Split the data into training and testing sets
X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2, random_state=42)

# Initialize Decision Tree classifier

decision_tree = DecisionTreeClassifier()

# Train the model

decision_tree.fit(X_train, y_train)

# Make predictions on the testing set

y_pred = decision_tree.predict(X_test)

# Calculate accuracy
accuracy = accuracy_score(y_test, y_pred)
print("Accuracy:", accuracy)

# Print confusion matrix

print("\nConfusion Matrix:")
print(confusion_matrix(y_test, y_pred))

# Print classification report

print("\nClassification Report:")
print(classification_report(y_test, y_pred))

# Convert feature names Index to a list

feature_names = X.columns.tolist()

# Plot the decision tree

plt.figure(figsize=(12, 8))
plot_tree(decision_tree, feature_names=feature_names, class_names=['No', 'Yes'], filled=True)
plt.show()
Output
Accuracy: 1.0

Confusion Matrix:
[[1 0]
[0 2]]

Classification Report:
Precision recall f1-score support

No 1.00 1.00 1.00 1

Yes 1.00 1.00 1.00 2

accuracy 1.00 3
macro avg 1.00 1.00 1.00 3
weighted avg 1.00 1.00 1.00 3
Step by Step Explanation
1. Import Necessary Libraries:

import pandas as pd

from sklearn.model_selection import train_test_split

from sklearn.tree import DecisionTreeClassifier, plot_tree

from sklearn.metrics import confusion_matrix, accuracy_score, classification_report

import matplotlib.pyplot as plt

Explanation:

 pandas: Library for data manipulation and analysis.

 train_test_split: Function to split the dataset into training and testing sets.
 DecisionTreeClassifier: Class for decision tree classification model.
 plot_tree: Function to visualize the decision tree.
 confusion_matrix, accuracy_score, classification_report: Functions to evaluate the model's
performance.
 matplotlib.pyplot: Library for plotting graphs.
2. Load the Dataset:

data = {

'Outlook': ['Sunny', 'Sunny', 'Overcast', 'Rainy', 'Rainy', 'Rainy', 'Overcast', 'Sunny', 'Sunny',
'Rainy', 'Sunny', 'Overcast', 'Overcast', 'Rainy'],

'Temperature': ['Hot', 'Hot', 'Hot', 'Mild', 'Cool', 'Cool', 'Cool', 'Mild', 'Cool', 'Mild', 'Mild',
'Mild', 'Hot', 'Mild'],

'Humidity': ['High', 'High', 'High', 'High', 'Normal', 'Normal', 'Normal', 'High', 'Normal',
'Normal', 'Normal', 'High', 'Normal', 'High'],

'Wind': ['Weak', 'Strong', 'Weak', 'Weak', 'Weak', 'Strong', 'Strong', 'Weak', 'Weak', 'Weak',
'Strong', 'Strong', 'Weak', 'Strong'],

'PlayTennis': ['No', 'No', 'Yes', 'Yes', 'Yes', 'No', 'Yes', 'No', 'Yes', 'Yes', 'Yes', 'Yes', 'Yes', 'No']

Df=pd.DataFrame(data)

Explanation:

 We define a dictionary containing the "Play Tennis" dataset.

 Then we convert this dictionary to a pandas DataFrame.
3. Data Preprocessing:

df = pd.get_dummies(df, columns=['Outlook', 'Temperature', 'Humidity', 'Wind'])

X = df.drop('PlayTennis', axis=1)

y = df['PlayTennis']
Explanation:

 We use one-hot encoding to convert categorical variables into numerical format.

 X contains the features, and y contains the target variable.

4. Split Data into Training and Testing Sets:

X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2, random_state=42)

Explanation:

 We split the dataset into training and testing sets using train_test_split function.
 We use 80% of the data for training and 20% for testing.

5. Initialize and Train Decision Tree Model:

decision_tree = DecisionTreeClassifier()

decision_tree.fit(X_train, y_train)

Explanation:

 We initialize a DecisionTreeClassifier object.

 Then we train the model using the training data.

6. Make Predictions and Evaluate Model:

y_pred = decision_tree.predict(X_test)

accuracy = accuracy_score(y_test, y_pred)

conf_matrix = confusion_matrix(y_test, y_pred)

class_report = classification_report(y_test, y_pred)

Explanation:

 We make predictions on the testing data using predict method.

 Then we calculate accuracy using accuracy_score.
 We also compute the confusion matrix and classification report.

7. Print Model Evaluation Metrics:

print("Accuracy:", accuracy)

print("\nConfusion Matrix:")

print(conf_matrix)

print("\nClassification Report:")

print(class_report)
Explanation:

 We print the accuracy, confusion matrix, and classification report to evaluate the
model's performance.

8. Plot the Decision Tree:

plt.figure(figsize=(12, 8))

plot_tree(decision_tree, feature_names=X.columns, class_names=['No', 'Yes'], filled=True)

plt.show()

Explanation:

 Finally, we plot the decision tree using plot_tree function to visualize the model's
decision-making process.
 We specify feature names and class names for better interpretation of the tree.

**************************

Model Evaluation and Selection Cheatsheet 1708023215
No ratings yet
Model Evaluation and Selection Cheatsheet 1708023215
7 pages
QAP Bridge
100% (2)
QAP Bridge
23 pages
SAP Made Easy - by Shannu
100% (1)
SAP Made Easy - by Shannu
151 pages
ML5_Implementation
No ratings yet
ML5_Implementation
32 pages
Decision Tree - Jupyter Notebook
No ratings yet
Decision Tree - Jupyter Notebook
4 pages
Assignment 3
No ratings yet
Assignment 3
3 pages
23BCE7092_ML_Lab_Assignment[1]
No ratings yet
23BCE7092_ML_Lab_Assignment[1]
14 pages
Lecture 7.2 - DTC Algorithm Implementation
No ratings yet
Lecture 7.2 - DTC Algorithm Implementation
7 pages
ML
No ratings yet
ML
11 pages
practical 15 python
No ratings yet
practical 15 python
6 pages
Out Put Code
No ratings yet
Out Put Code
2 pages
MANUAL (2)
No ratings yet
MANUAL (2)
33 pages
da-lab3-221it084-final (1)
No ratings yet
da-lab3-221it084-final (1)
6 pages
5b Python Implementation of Decision Tree
No ratings yet
5b Python Implementation of Decision Tree
7 pages
Machine Learning
No ratings yet
Machine Learning
16 pages
vertopal.com_najir shaikh practical 4
No ratings yet
vertopal.com_najir shaikh practical 4
4 pages
ML NEW Final Format
No ratings yet
ML NEW Final Format
37 pages
Expt7_ML2025_250306_143857
No ratings yet
Expt7_ML2025_250306_143857
5 pages
ml using python programs
No ratings yet
ml using python programs
12 pages
MANUAL (1)
No ratings yet
MANUAL (1)
34 pages
MLA Lab 6:-Implementation of Decision Tree
No ratings yet
MLA Lab 6:-Implementation of Decision Tree
16 pages
Prac5 AAM
No ratings yet
Prac5 AAM
2 pages
Aiml 5-8
No ratings yet
Aiml 5-8
19 pages
Experiment 8 Code
No ratings yet
Experiment 8 Code
3 pages
23BCE7199 ML Lab Assignment[1]
No ratings yet
23BCE7199 ML Lab Assignment[1]
15 pages
Decision Tree Algorithm in Machine Learning
No ratings yet
Decision Tree Algorithm in Machine Learning
13 pages
ml lab programs 2
No ratings yet
ml lab programs 2
16 pages
Decision Tree
No ratings yet
Decision Tree
5 pages
Naive Bayes Classification
No ratings yet
Naive Bayes Classification
8 pages
ml.yogesh
No ratings yet
ml.yogesh
23 pages
Decision Tree (1)
No ratings yet
Decision Tree (1)
2 pages
Machine Learning Lab: Raheel Aslam (74-FET/BSEE/F16)
No ratings yet
Machine Learning Lab: Raheel Aslam (74-FET/BSEE/F16)
3 pages
LAB MANUAL For Machine Learning
No ratings yet
LAB MANUAL For Machine Learning
15 pages
NF Assighment4
No ratings yet
NF Assighment4
5 pages
3 Classification
No ratings yet
3 Classification
16 pages
FDP Session 4 (Decision Tree)
No ratings yet
FDP Session 4 (Decision Tree)
1 page
FDS Lab Manual
No ratings yet
FDS Lab Manual
10 pages
7
No ratings yet
7
2 pages
ML Shristi File
No ratings yet
ML Shristi File
49 pages
DSBDA_10
No ratings yet
DSBDA_10
5 pages
MACHINE LEARNING (1)
No ratings yet
MACHINE LEARNING (1)
12 pages
Decision Tree
No ratings yet
Decision Tree
3 pages
ml exp-5,6 (1)[1] (1)
No ratings yet
ml exp-5,6 (1)[1] (1)
6 pages
Progress of CATBOOST ALGORITHM FOR ELECTRICITY THEFT DETECTION IN POWER UTILITIES
No ratings yet
Progress of CATBOOST ALGORITHM FOR ELECTRICITY THEFT DETECTION IN POWER UTILITIES
9 pages
ML Ex1
No ratings yet
ML Ex1
12 pages
Import Pandas As PD DF PD - Read - CSV ("Titanic - Train - CSV") DF - Head
No ratings yet
Import Pandas As PD DF PD - Read - CSV ("Titanic - Train - CSV") DF - Head
20 pages
DWDM Lab 2
No ratings yet
DWDM Lab 2
3 pages
This Study Resource Was
No ratings yet
This Study Resource Was
5 pages
Croprecommender Copy1
No ratings yet
Croprecommender Copy1
5 pages
22K61A0654_2_sasi_auto
No ratings yet
22K61A0654_2_sasi_auto
24 pages
ML Codes
No ratings yet
ML Codes
9 pages
DT_R
No ratings yet
DT_R
2 pages
ML_4,5 (1)
No ratings yet
ML_4,5 (1)
5 pages
Ex No 6
No ratings yet
Ex No 6
3 pages
PYHTONPRACT
No ratings yet
PYHTONPRACT
4 pages
DA_LAB3_221IT064
No ratings yet
DA_LAB3_221IT064
6 pages
Naive bayes gaussian table tennis - Jupyter Notebook
No ratings yet
Naive bayes gaussian table tennis - Jupyter Notebook
6 pages
MLT 1 - 7 Kanish
No ratings yet
MLT 1 - 7 Kanish
24 pages
Decision_Tree_Regression.ipynb - Colab
No ratings yet
Decision_Tree_Regression.ipynb - Colab
3 pages
CSET301 LabW8L2
No ratings yet
CSET301 LabW8L2
1 page
ML2
No ratings yet
ML2
7 pages
Advanced C Concepts and Programming: First Edition
From Everand
Advanced C Concepts and Programming: First Edition
Gayatri
3/5 (1)
Web Tech Pratical
No ratings yet
Web Tech Pratical
33 pages
Case Based Reasoning Presentation
No ratings yet
Case Based Reasoning Presentation
6 pages
Data 3rd Yr Bsc Data Science
No ratings yet
Data 3rd Yr Bsc Data Science
10 pages
Internals 2 Web
No ratings yet
Internals 2 Web
4 pages
CURRENT AGILE PRACTICES
No ratings yet
CURRENT AGILE PRACTICES
12 pages
Byrd Canción Del Pájaro TPT Pno 1
No ratings yet
Byrd Canción Del Pájaro TPT Pno 1
12 pages
Articulo HPLC Masas
No ratings yet
Articulo HPLC Masas
7 pages
PeopleLink Quadro P 2020
No ratings yet
PeopleLink Quadro P 2020
4 pages
Maria Baghramian - Reading Putnam (2013, Routledge) - Libgen - Li
100% (2)
Maria Baghramian - Reading Putnam (2013, Routledge) - Libgen - Li
398 pages
Modeling and Thermal Simulation of A PHEV Battery Module With Cylindrical LFP Cells
No ratings yet
Modeling and Thermal Simulation of A PHEV Battery Module With Cylindrical LFP Cells
11 pages
Celdas de Carga Hardy
100% (1)
Celdas de Carga Hardy
28 pages
Pickit Linux Manual Ubuntu
No ratings yet
Pickit Linux Manual Ubuntu
17 pages
Math 9 q3 Week 1A With Summative
No ratings yet
Math 9 q3 Week 1A With Summative
9 pages
Stokhan Comprehensive College: No 5, Stokhan School Close ABS Bus Stop Isashi
No ratings yet
Stokhan Comprehensive College: No 5, Stokhan School Close ABS Bus Stop Isashi
4 pages
Preparation: ERICSSON Node B Commissioning and Integration
No ratings yet
Preparation: ERICSSON Node B Commissioning and Integration
37 pages
MODEL 65 Data Sheet
No ratings yet
MODEL 65 Data Sheet
2 pages
Chemistry 10 TH
No ratings yet
Chemistry 10 TH
4 pages
Matlab Gui
No ratings yet
Matlab Gui
2 pages
AI Mini Report
No ratings yet
AI Mini Report
4 pages
Sample Courses - Co Po Mapping
No ratings yet
Sample Courses - Co Po Mapping
6 pages
English Language and Literature Code Number 184 Class IX (2021-22) Syllabus of Term 1 Reading Section
No ratings yet
English Language and Literature Code Number 184 Class IX (2021-22) Syllabus of Term 1 Reading Section
6 pages
D-6169-98 Selection of Soil and Rock Sampling Devices Used With Drill Rigs For Environmental Investigations
No ratings yet
D-6169-98 Selection of Soil and Rock Sampling Devices Used With Drill Rigs For Environmental Investigations
19 pages
Apache Hive Essentials - Sample Chapter
No ratings yet
Apache Hive Essentials - Sample Chapter
13 pages
P2-Oct-2019 QP-2
No ratings yet
P2-Oct-2019 QP-2
32 pages
10G EPON OLT Quick Operation Guide
100% (1)
10G EPON OLT Quick Operation Guide
13 pages
Air Load Break Switch
No ratings yet
Air Load Break Switch
30 pages
Tms 320 F 28379 D
No ratings yet
Tms 320 F 28379 D
222 pages
Ingress Protection Reference Chart RM Technical
No ratings yet
Ingress Protection Reference Chart RM Technical
2 pages
System For Remote Monitoring and Control of Baby Incubator and Warmer PDF
No ratings yet
System For Remote Monitoring and Control of Baby Incubator and Warmer PDF
7 pages
pyq gemotry
No ratings yet
pyq gemotry
14 pages
All Languages of The World
No ratings yet
All Languages of The World
3 pages
Microfabrication Lab Georgia Tech 2011: David Gottfried Mikkel Thomas Paul Joseph Greg Book Janet Cobb-Sullivan
No ratings yet
Microfabrication Lab Georgia Tech 2011: David Gottfried Mikkel Thomas Paul Joseph Greg Book Janet Cobb-Sullivan
40 pages
MNP Call Flow
No ratings yet
MNP Call Flow
2 pages
Bodyspace Anthropometry Ergonomics and The Design ... - (10 Anthropometric Data)
No ratings yet
Bodyspace Anthropometry Ergonomics and The Design ... - (10 Anthropometric Data)
3 pages

Decision Tree

Uploaded by

Decision Tree

Uploaded by

DECISION TREE

# Import necessary libraries

# Load the dataset

# Convert the dictionary to a DataFrame

# Convert categorical variables to numerical using one-hot encoding

# Separate features and target variable

# Initialize Decision Tree classifier

# Train the model

# Make predictions on the testing set

# Print confusion matrix

# Print classification report

# Convert feature names Index to a list

# Plot the decision tree

No 1.00 1.00 1.00 1

from sklearn.model_selection import train_test_split

from sklearn.tree import DecisionTreeClassifier, plot_tree

from sklearn.metrics import confusion_matrix, accuracy_score, classification_report

import matplotlib.pyplot as plt

 pandas: Library for data manipulation and analysis.

 We define a dictionary containing the "Play Tennis" dataset.

df = pd.get_dummies(df, columns=['Outlook', 'Temperature', 'Humidity', 'Wind'])

 We use one-hot encoding to convert categorical variables into numerical format.

4. Split Data into Training and Testing Sets:

X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2, random_state=42)

5. Initialize and Train Decision Tree Model:

 We initialize a DecisionTreeClassifier object.

6. Make Predictions and Evaluate Model:

accuracy = accuracy_score(y_test, y_pred)

conf_matrix = confusion_matrix(y_test, y_pred)

class_report = classification_report(y_test, y_pred)

 We make predictions on the testing data using predict method.

7. Print Model Evaluation Metrics:

8. Plot the Decision Tree:

plot_tree(decision_tree, feature_names=X.columns, class_names=['No', 'Yes'], filled=True)

You might also like