0% found this document useful (0 votes)

8 views

Decision Tree

Uploaded by

Hải my

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

8 views

Decision Tree

Uploaded by

Hải my

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 1

Date: 14/10/2024

To: Mitch Cochran

From: Nguyen An Quynh
Subject: Prediction on Survived Passenger of Titanic Using Decision Tree
Background: Leveraging the well-known Titanic dataset, this study utilizes machine learning to
predict survival outcomes based on passenger characteristics such as socio-economic status
(Pclass), age, gender, and travel fare. In particular, we explore decision tree models with two
splitting criteria: the Gini Index and Gain Ratio. These criteria are widely used in classification
tasks, and a comparison of their performances offers insights into their relative effectiveness for
survival prediction in this context.

The initial dataset preprocessing included handling missing values, particularly in the Age and
Embarked columns, where average imputation and mode imputation were applied, respectively.
To prepare categorical variables, we used one-hot encoding for the "Sex" and "Embarked"
columns, creating separate binary columns and avoiding arbitrary numerical assignments that
might mislead the model. Non-predictive columns, such as PassengerID, Name, Ticket, and
Cabin, were removed from the feature set, as they do not directly influence survival outcomes.
Thus, the selected feature set consisted of Pclass, Sex, Age, Parch, Fare, and Embarked.

Two decision tree models were implemented: one based on the Gini Index and the other on Gain
Ratio. The models were trained on 80% of the data and evaluated on the remaining 20%. Key
performance metrics—accuracy, precision, recall, and F1-score—were used to assess each
model’s effectiveness.

In conclusion, both Gini Index and Gain Ratio are effective criteria for decision tree
classification on this dataset, with minimal differences in performance; however the Gini Index
model slightly outperformed the Gain Ratio model, making it a marginally better choice for
survival prediction in this case. The Gini Index model achieved an accuracy of 0.810, with
precision, recall, and F1-score all at 0.770. In comparison, the Gain Ratio model reached an
accuracy of 0.804, with a precision of 0.760, recall of 0.770, and F1-score of 0.765. Future work
could explore ensemble methods like random forests to potentially increase accuracy and
generalizability. This study highlights the utility of decision trees in survival analysis and their
adaptability to different criteria, offering a foundational approach for further predictive analytics
in similar datasets.

Predictive Modeling of Titanic Survivors
No ratings yet
Predictive Modeling of Titanic Survivors
12 pages
Rouse Final
No ratings yet
Rouse Final
8 pages
Using Titanic Dataset for Comprehensive Machine Learning Model Training
No ratings yet
Using Titanic Dataset for Comprehensive Machine Learning Model Training
3 pages
Titanic Eda
No ratings yet
Titanic Eda
14 pages
Titanic
No ratings yet
Titanic
1 page
Titanic
No ratings yet
Titanic
6 pages
Bhabesh - Chapter 3 Complete Editing Including Summary
No ratings yet
Bhabesh - Chapter 3 Complete Editing Including Summary
18 pages
ML Report-1
No ratings yet
ML Report-1
13 pages
CEP Final
No ratings yet
CEP Final
11 pages
TITANIC SURVIVAL PREDICTION USING ML MINIPROJECT
No ratings yet
TITANIC SURVIVAL PREDICTION USING ML MINIPROJECT
21 pages
Decision Tree
No ratings yet
Decision Tree
12 pages
MCA- Project Documentation Guidelines 2024-2025
No ratings yet
MCA- Project Documentation Guidelines 2024-2025
26 pages
LamTang TitanicMachineLearningFromDisaster
No ratings yet
LamTang TitanicMachineLearningFromDisaster
5 pages
Heart Disease Predictor - ML - Report
No ratings yet
Heart Disease Predictor - ML - Report
15 pages
Exploratory Data Analysis of Titanic Survival Prediction Using Machine Learning Techniques
No ratings yet
Exploratory Data Analysis of Titanic Survival Prediction Using Machine Learning Techniques
5 pages
PredictingTitanicSurvivorsusing by Applying Exploratory Data Anyltics and ML
No ratings yet
PredictingTitanicSurvivorsusing by Applying Exploratory Data Anyltics and ML
7 pages
Random Forest Algorithm - Titanic Dataset
No ratings yet
Random Forest Algorithm - Titanic Dataset
12 pages
Prediction of Heart Disease Using Decision Tree in Comparison With KNN To Improve Accuracy
No ratings yet
Prediction of Heart Disease Using Decision Tree in Comparison With KNN To Improve Accuracy
5 pages
5.KNN Naive Bayes and DT
No ratings yet
5.KNN Naive Bayes and DT
44 pages
Machine Learning
100% (1)
Machine Learning
62 pages
Titanic Classification Project
No ratings yet
Titanic Classification Project
17 pages
20210913115613D3708 - Session 05-08 Decision Tree Classification
No ratings yet
20210913115613D3708 - Session 05-08 Decision Tree Classification
37 pages
ML-Lecture-8-9-Classification
No ratings yet
ML-Lecture-8-9-Classification
35 pages
WIREs Computational Stats - 2013 - de Ville - Decision Trees
No ratings yet
WIREs Computational Stats - 2013 - de Ville - Decision Trees
8 pages
Examples
No ratings yet
Examples
8 pages
4. Classification
No ratings yet
4. Classification
75 pages
Konsep Ensemble
No ratings yet
Konsep Ensemble
52 pages
Decision Trees: Decision Tree Is One of The Most Widely Used and
No ratings yet
Decision Trees: Decision Tree Is One of The Most Widely Used and
53 pages
Decision Tree
No ratings yet
Decision Tree
34 pages
Machine Learnig - Mini Project
No ratings yet
Machine Learnig - Mini Project
5 pages
MIS410-Chapter6
No ratings yet
MIS410-Chapter6
47 pages
EDA Cat2
No ratings yet
EDA Cat2
54 pages
Decision Trees
No ratings yet
Decision Trees
8 pages
Decision Tree Learning
No ratings yet
Decision Tree Learning
22 pages
23-01!08!00 CS 633 Data Mining Prediction With Decision Trees PDF.pdf
No ratings yet
23-01!08!00 CS 633 Data Mining Prediction With Decision Trees PDF.pdf
80 pages
Practical Aplication 2
No ratings yet
Practical Aplication 2
10 pages
11.survival Prediction Models An Introduction To Discretetime Modelling
No ratings yet
11.survival Prediction Models An Introduction To Discretetime Modelling
18 pages
Decision Tree.pptx
No ratings yet
Decision Tree.pptx
41 pages
Titanic (4)
No ratings yet
Titanic (4)
3 pages
Titanic (5)
No ratings yet
Titanic (5)
3 pages
Decision Tree
No ratings yet
Decision Tree
33 pages
Decision Tree
No ratings yet
Decision Tree
2 pages
Decision Tree Algorithm
No ratings yet
Decision Tree Algorithm
12 pages
Decision Tree
No ratings yet
Decision Tree
14 pages
Predicting The Presence of Heart Diseases Using Comparative Data Mining and Machine Learning Algorithms
No ratings yet
Predicting The Presence of Heart Diseases Using Comparative Data Mining and Machine Learning Algorithms
5 pages
LP3 - ML Mini-Project Report Format Shreeyas
No ratings yet
LP3 - ML Mini-Project Report Format Shreeyas
13 pages
Decision Tree Induction
No ratings yet
Decision Tree Induction
52 pages
Data Science Concepts Lesson04 Decision Tree Concepts
No ratings yet
Data Science Concepts Lesson04 Decision Tree Concepts
22 pages
Titanic Report ml report
No ratings yet
Titanic Report ml report
14 pages
Classification and Prediction
100% (1)
Classification and Prediction
31 pages
Decision Tree Learning
No ratings yet
Decision Tree Learning
10 pages
Introduction To Decision Tree: Gini Index
No ratings yet
Introduction To Decision Tree: Gini Index
15 pages
Decision Tree
No ratings yet
Decision Tree
43 pages
Decision tree
No ratings yet
Decision tree
1 page
ML2
No ratings yet
ML2
7 pages
ML L8 Decision Tree
No ratings yet
ML L8 Decision Tree
109 pages
Decision Trees
67% (3)
Decision Trees
14 pages
Applied Econometrics: A Simple Introduction
From Everand
Applied Econometrics: A Simple Introduction
K.H. Erickson
5/5 (2)
Highway-Rail Grade Crossing Identification and Prioritizing Model Development
From Everand
Highway-Rail Grade Crossing Identification and Prioritizing Model Development
Maxim A. Dulebenets
No ratings yet
Introduction To Non Parametric Methods Through R Software
From Everand
Introduction To Non Parametric Methods Through R Software
Editor IJSMI
No ratings yet
Ket Qua Doi Tuyen Tinh
No ratings yet
Ket Qua Doi Tuyen Tinh
9 pages
Chap003 National Environment+5-11 SocioCulture+Eco+Natural+pl+legal
No ratings yet
Chap003 National Environment+5-11 SocioCulture+Eco+Natural+pl+legal
111 pages
Chap002 Inter environment+Trade+FDI+institution
No ratings yet
Chap002 Inter environment+Trade+FDI+institution
107 pages
Calculus For Economics2021
No ratings yet
Calculus For Economics2021
227 pages
Chap004 InterStrategy+struc 12+13
No ratings yet
Chap004 InterStrategy+struc 12+13
31 pages
Review
No ratings yet
Review
17 pages
Overview of US Insurance 3-30-23
No ratings yet
Overview of US Insurance 3-30-23
26 pages
Personal Finance Course. Mock Exam 2021.2022.without Answer
No ratings yet
Personal Finance Course. Mock Exam 2021.2022.without Answer
16 pages
Hắc Hường - Written Summary (AutoRecovered)
No ratings yet
Hắc Hường - Written Summary (AutoRecovered)
10 pages
Chap005 Analysis+Access 14+15
No ratings yet
Chap005 Analysis+Access 14+15
26 pages
Chap006 Inter Management+15-20+ Mark+ SCM+HRM+FInMan
No ratings yet
Chap006 Inter Management+15-20+ Mark+ SCM+HRM+FInMan
62 pages
Project Intern Cover Letter - Nguyen An Quynh
No ratings yet
Project Intern Cover Letter - Nguyen An Quynh
2 pages
NGHIÊN CỨU VĂN HÓA-SV- CÂU HỎI ÔN TẬP-ĐỢT 2-2023-SV
No ratings yet
NGHIÊN CỨU VĂN HÓA-SV- CÂU HỎI ÔN TẬP-ĐỢT 2-2023-SV
20 pages
Online Risky
No ratings yet
Online Risky
1 page
The Relationship Between Social Media Use and Fear of Missing Out - A Meta-Analysis
No ratings yet
The Relationship Between Social Media Use and Fear of Missing Out - A Meta-Analysis
16 pages
Giao tiếp kinh doanh
No ratings yet
Giao tiếp kinh doanh
1 page
(English (Auto-Generated) ) New Toyota Yaris - Ouality Throughout The Supply Chain (DownSub - Com)
No ratings yet
(English (Auto-Generated) ) New Toyota Yaris - Ouality Throughout The Supply Chain (DownSub - Com)
4 pages
Law ôn tập
No ratings yet
Law ôn tập
67 pages

Decision Tree

Uploaded by

Decision Tree

Uploaded by

Date: 14/10/2024

To: Mitch Cochran

You might also like