DS_7

The document outlines an experiment for a Data Science Lab course at St. Francis Institute of Technology, focusing on implementing the Random Forest supervised learning algorithm. It includes objectives, prerequisites, theoretical background, and laboratory procedures for using Google Colab to demonstrate the algorithm on a dataset. Additionally, it covers post-experiment exercises, including real-life applications and conclusions regarding the significance of the program.

Uploaded by

Revati

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

4 views

DS_7

Uploaded by

Revati

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

St.

Francis Institute of Technology, Mumbai-400 103

Department Of Information Technology
A.Y. 2024-2025
Class: BE-ITA/B, Semester: VII
Subject: Data Science Lab
Experiment – 7
1. Aim: To implement Supervised Learning algorithm - Random Forest.
2. Objectives: Students should be familiarize with Learning Architectures and Frameworks
3. Prerequisite: Python basics
4. Pre-Experiment Exercise:
Theory:
Random Forest Algorithm
Decision trees involve the greedy selection of the best split point from the dataset at each step.
This algorithm makes decision trees susceptible to high variance if they are not pruned. This high variance can be
harnessed and reduced by creating multiple trees with different samples of the training dataset (different views of
the problem) and combining their predictions. This approach is called bootstrap aggregation or bagging for short.

A limitation of bagging is that the same greedy algorithm is used to create each tree, meaning that it is likely that
the same or very similar split points will be chosen in each tree making the different trees very similar (trees will
be correlated). This, in turn, makes their predictions similar, mitigating the variance originally sought.
We can force the decision trees to be different by limiting the features (rows) that the greedy algorithm can evaluate
at each split point when creating the tree. This is called the Random Forest algorithm.

Like bagging, multiple samples of the training dataset are taken and a different tree trained on each. The difference
is that at each point a split is made in the data and added to the tree, only a fixed subset of attributes can be
considered.
For classification problems, the type of problems we will look at in this tutorial, the number of attributes to be
considered for the split is limited to the square root of the number of input features.

num_features_for_split = sqrt(total_input_features)
The result of this one small change are trees that are more different from each other (uncorrelated) resulting
predictions that are more diverse and a combined prediction that often has better performance that single tree or
bagging alone.

6. Laboratory Exercise
Procedure
i. Use google colab for programming.
ii. Import required packages.
iii. Demonstrate random forest classifier for any given dataset.
iv. Add relevant comments in your programs and execute the code. Test it for various cases.
Post-Experiments Exercise:
A. Extended Theory:
a. Write real life applications of Random Forest Classifier.
B. Conclusion:
1. Write what was performed in the program (s) .
2. What is the significance of program and what Objective is achieved?
C. References:
[1] https://machinelearningmastery.com/implement-random-forest-scratch-python/.
[2] https://www.geeksforgeeks.org/random-forest-classifier-using-scikit-learn/
[3] https://www.analyticsvidhya.com/blog/2021/06/understanding-random-forest/

Machine Learning With Random Forests and Decision Trees - A Visual Guide For Beginners by Scott Hartshorn
No ratings yet
Machine Learning With Random Forests and Decision Trees - A Visual Guide For Beginners by Scott Hartshorn
73 pages
Machine Learning With Random Forests and Decision Trees - A Visual Guide For Beginners (Naren) PDF
No ratings yet
Machine Learning With Random Forests and Decision Trees - A Visual Guide For Beginners (Naren) PDF
68 pages
Computer Engineering Projects 4
100% (1)
Computer Engineering Projects 4
2 pages
MAN 48 60 Engine Extern
100% (2)
MAN 48 60 Engine Extern
688 pages
Operation and Maintenance of Power Plant
100% (1)
Operation and Maintenance of Power Plant
31 pages
ML Asst.-01(25) (1)
No ratings yet
ML Asst.-01(25) (1)
21 pages
Random Forest
No ratings yet
Random Forest
2 pages
Random Forest Algorithm unit 3
No ratings yet
Random Forest Algorithm unit 3
2 pages
2023AIB1008_Lab08
No ratings yet
2023AIB1008_Lab08
8 pages
Machine Learning Random Forest Algorithm - Javatpoint
No ratings yet
Machine Learning Random Forest Algorithm - Javatpoint
14 pages
Machine Learning - Random Forest
No ratings yet
Machine Learning - Random Forest
6 pages
Forest
No ratings yet
Forest
2 pages
CSL0777 L26
No ratings yet
CSL0777 L26
33 pages
03_Random Forest
No ratings yet
03_Random Forest
24 pages
10 Random - Forest - Algo
No ratings yet
10 Random - Forest - Algo
6 pages
Learn Python From Scratch
No ratings yet
Learn Python From Scratch
9 pages
Random Forest Algorithm
No ratings yet
Random Forest Algorithm
9 pages
Random Forest Algorithms - Comprehensive Guide With Examples
No ratings yet
Random Forest Algorithms - Comprehensive Guide With Examples
13 pages
ml7
No ratings yet
ml7
5 pages
Assessment of The Random Forest Algorithm 1
No ratings yet
Assessment of The Random Forest Algorithm 1
4 pages
Machine Learning: Practical Tutorial On Random Forest and Parameter Tuning in R
No ratings yet
Machine Learning: Practical Tutorial On Random Forest and Parameter Tuning in R
11 pages
Python Implementation of Random Forest Algorithm
No ratings yet
Python Implementation of Random Forest Algorithm
10 pages
Random Forest
No ratings yet
Random Forest
21 pages
Lecture+Notes+-+Random Forests
No ratings yet
Lecture+Notes+-+Random Forests
10 pages
Random Forest
No ratings yet
Random Forest
13 pages
Machine Learning With Random Forests - by Knoldus Inc. - Knoldus - Technical Insights - Medium
No ratings yet
Machine Learning With Random Forests - by Knoldus Inc. - Knoldus - Technical Insights - Medium
12 pages
Random forest algorithm 1
No ratings yet
Random forest algorithm 1
14 pages
ML pp12_u2
No ratings yet
ML pp12_u2
18 pages
Da MS
No ratings yet
Da MS
24 pages
Random Forest Algorithm
No ratings yet
Random Forest Algorithm
4 pages
Random Forest
No ratings yet
Random Forest
14 pages
Random Forest (RF) : Decision Trees
No ratings yet
Random Forest (RF) : Decision Trees
3 pages
Random FOrest
No ratings yet
Random FOrest
19 pages
Class 7 Random Forest Algorithm
No ratings yet
Class 7 Random Forest Algorithm
13 pages
Random Forest
No ratings yet
Random Forest
8 pages
Random Forest Algorithm
No ratings yet
Random Forest Algorithm
3 pages
Random_Forest_Algorithm
No ratings yet
Random_Forest_Algorithm
2 pages
Hartshorn, Scott 2016 - Machin Learning With Random Forests and Decision Trees - A Visual Guide For Beginners
No ratings yet
Hartshorn, Scott 2016 - Machin Learning With Random Forests and Decision Trees - A Visual Guide For Beginners
98 pages
Random Forest - Basics
No ratings yet
Random Forest - Basics
9 pages
Lab 10 - Random Forest Classifier
No ratings yet
Lab 10 - Random Forest Classifier
3 pages
Random Forest
No ratings yet
Random Forest
2 pages
Random Forest
No ratings yet
Random Forest
18 pages
ML-Lec6
No ratings yet
ML-Lec6
4 pages
RANDOM FOREST
No ratings yet
RANDOM FOREST
4 pages
Practical No4 - 5 ML
No ratings yet
Practical No4 - 5 ML
11 pages
Random Forests: H S H H
No ratings yet
Random Forests: H S H H
2 pages
Deep Learning and Neural Networks
No ratings yet
Deep Learning and Neural Networks
21 pages
10 Random Forest
No ratings yet
10 Random Forest
13 pages
Machine learning
No ratings yet
Machine learning
23 pages
Random Forests
No ratings yet
Random Forests
43 pages
Random Forest
No ratings yet
Random Forest
3 pages
Biau 2016
No ratings yet
Biau 2016
31 pages
Ijeit1412201405 47
No ratings yet
Ijeit1412201405 47
7 pages
Random Forests 2
No ratings yet
Random Forests 2
43 pages
Random Forest
No ratings yet
Random Forest
32 pages
Random Forest
No ratings yet
Random Forest
11 pages
Schonlau Zou 2020 The Random Forest Algorithm For Statistical Learning
No ratings yet
Schonlau Zou 2020 The Random Forest Algorithm For Statistical Learning
27 pages
25 June 2024 12:34: Random Fores Page 1
No ratings yet
25 June 2024 12:34: Random Fores Page 1
6 pages
Image Classification: Step-by-step Classifying Images with Python and Techniques of Computer Vision and Machine Learning
From Everand
Image Classification: Step-by-step Classifying Images with Python and Techniques of Computer Vision and Machine Learning
Mark Magic
No ratings yet
IGNOU PGDCA MCS 206 Object Oriented Programming using Java Previous Years solved Papers
From Everand
IGNOU PGDCA MCS 206 Object Oriented Programming using Java Previous Years solved Papers
Manish Soni
No ratings yet
Machine Learning with Python: A Comprehensive Guide with a Practical Example
From Everand
Machine Learning with Python: A Comprehensive Guide with a Practical Example
MARTIN NEEL
No ratings yet
Pedestrian Detection: Please, suggest a subtitle for a book with title 'Pedestrian Detection' within the realm of 'Computer Vision'. The suggested subtitle should not have ':'.
From Everand
Pedestrian Detection: Please, suggest a subtitle for a book with title 'Pedestrian Detection' within the realm of 'Computer Vision'. The suggested subtitle should not have ':'.
Fouad Sabry
No ratings yet
AI for Everyone: An Intermediate Guide to Artificial Intelligence
From Everand
AI for Everyone: An Intermediate Guide to Artificial Intelligence
Nova Clarke
No ratings yet
Nse Bse Letter
No ratings yet
Nse Bse Letter
29 pages
DMR-223 0001
No ratings yet
DMR-223 0001
2 pages
Job Hazard Analysis Ladder Safety: Reference
No ratings yet
Job Hazard Analysis Ladder Safety: Reference
2 pages
Tundi 15 Kva Quotation
No ratings yet
Tundi 15 Kva Quotation
3 pages
Data Fabric Collateral (EU)
No ratings yet
Data Fabric Collateral (EU)
2 pages
Datacard® CE870™ Instant Issuance System: User Reference Guide
No ratings yet
Datacard® CE870™ Instant Issuance System: User Reference Guide
13 pages
An Introduction To Managerial Accounting and Cost Concepts
No ratings yet
An Introduction To Managerial Accounting and Cost Concepts
70 pages
A Review of The Path To Consistency
No ratings yet
A Review of The Path To Consistency
8 pages
ARBURG - Multi-Component - Injection Moulding PDF
No ratings yet
ARBURG - Multi-Component - Injection Moulding PDF
16 pages
Effect of Information Communication Technology To Human Development
No ratings yet
Effect of Information Communication Technology To Human Development
7 pages
Public Interest Litigation
No ratings yet
Public Interest Litigation
8 pages
Tayug Rural Bank v. Central Bank
No ratings yet
Tayug Rural Bank v. Central Bank
2 pages
BIS 14665 Part 2
No ratings yet
BIS 14665 Part 2
6 pages
Micro Electro Mechanical Systems PDF
100% (1)
Micro Electro Mechanical Systems PDF
13 pages
Deep Learning Lab Manual
No ratings yet
Deep Learning Lab Manual
88 pages
Bautista Vs Araneta
No ratings yet
Bautista Vs Araneta
1 page
RoofMetal Catalog 2018
No ratings yet
RoofMetal Catalog 2018
38 pages
CSI Cardiology Update 2022 Two Volume Set 1st Edition Vijay Bang all chapter instant download
100% (9)
CSI Cardiology Update 2022 Two Volume Set 1st Edition Vijay Bang all chapter instant download
82 pages
DT 25 - 30 Manual de Partes
No ratings yet
DT 25 - 30 Manual de Partes
122 pages
MST Observation Feedback
No ratings yet
MST Observation Feedback
7 pages
7th Sem Internal papers (Mech)
No ratings yet
7th Sem Internal papers (Mech)
12 pages
Sinhala Speech Recognition For Interactive Voice Response Systems Accessed Through Mobile Phones
No ratings yet
Sinhala Speech Recognition For Interactive Voice Response Systems Accessed Through Mobile Phones
7 pages
IAL Chemistry SB2 Mark Scheme T17
100% (1)
IAL Chemistry SB2 Mark Scheme T17
3 pages
Data Link Layer Notes
No ratings yet
Data Link Layer Notes
101 pages
A26451800v01 (V16 - 02)
No ratings yet
A26451800v01 (V16 - 02)
4 pages
19 - Best Practices in Lead Generation
No ratings yet
19 - Best Practices in Lead Generation
3 pages
ITS OD 102 Network Security 0922
No ratings yet
ITS OD 102 Network Security 0922
2 pages

DS_7

Uploaded by

DS_7

Uploaded by

St.

Francis Institute of Technology, Mumbai-400 103

You might also like