Presentation 7 (2)

Uploaded by

blueorange630

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

9 views7 pages

Presentation 7 (2)

Uploaded by

blueorange630

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 7

Optimizing

Machine
Learning
Pipeline
Muhammad Omer - i220572
Shariq Usman - i220447
Azeem Chaudary - i220479
Introduction to Problems
• Data Imbalance:
o Class 0 (60.2%), Class 1 (39.8%).
• Missing Values
• 18.5% rows missing
• Distribution Issues:
• Skewed features (e.g., feature_4)
• Outliers (e.g., feature_7)
• Correlation Issues:
• Weak target correlation
• Dataset too small
• GPU slowed things down
Proposed Solution ML

1 Class Imbalance:
Applied under sampling
Preproces
2 sing
Missing Data:
Filled using median
3 Under Clustered to
sampling new feature
Feature Engineering:
• Log-scaled Feature 4
• Squared Feature 4
• Sine & cosine transform on Feature 6
• Clustered adjusted Feature 4 & Feature 7 Median
filling Periodic
Normalizatio
n
Parallelization
Approach
• Multi-threading: Ran tasks on all CPU
cores for preprocessing, training, and
testing where there’s parallel processing
possible.
Paralleli
• Broke data into chunks: Processed pieces ze
at the same time. Split multi-
tasks threading

• Added safety: Switched to single mode if

parallel failed.
data
into Added
chunks safety
Model
Performance
Comparison
• Models
• Random Forest: Works well with small
data, handles class imbalance
reasonably
• XGBoost: uses regularized (L1/L2) and
can capture nonlinear patterns
• Performance
• RFC: 59.17%, F1: 53.90%.
• XGBoost: 59.38%, F1: 47.71%.

• Best:
• XGBoost (59.38% accuracy)
• RFC (88.69% speedup)
• Resource Use:
• Memory: 486.58 MB.
• CPU: 0.00%.
Conclusion
• Achieved 80% faster processing,
best accuracy at 59.38%.
• Worked well on old systems with
low resource use

Future Work:
• Test with larger data for
better results.
• Explore advanced models for
higher accuracy
Machine parallel
processing performanc
AI Learning
e

distributed
LLMs
computing

Project
planning
Plannin
g Equipment

Prompts Dollars
Strategy Profit

sodapdf-converted (2) (1)
No ratings yet
sodapdf-converted (2) (1)
6 pages
Lec1 24th Nov
No ratings yet
Lec1 24th Nov
29 pages
W01 PracticalProblemsProjects
No ratings yet
W01 PracticalProblemsProjects
27 pages
Final Assigment of PDC
No ratings yet
Final Assigment of PDC
12 pages
Parallel Solving Tasks of Digital Image Processing
No ratings yet
Parallel Solving Tasks of Digital Image Processing
5 pages
Lecture 4-5
No ratings yet
Lecture 4-5
48 pages
机器学习_ 学习笔记 (All in One)_V0.97更多医学课请加微信782878241
No ratings yet
机器学习_ 学习笔记 (All in One)_V0.97更多医学课请加微信782878241
762 pages
index list
No ratings yet
index list
9 pages
Identifing Software Bugs or Not Using SMLT Model
No ratings yet
Identifing Software Bugs or Not Using SMLT Model
34 pages
Deep Learning
No ratings yet
Deep Learning
23 pages
Real Estate Web Application Using Flask
0% (1)
Real Estate Web Application Using Flask
11 pages
Designing Machine Learning Workflows in Python Chapter3
No ratings yet
Designing Machine Learning Workflows in Python Chapter3
42 pages
AI Image Classification With Neural Network 221002564 222002074
No ratings yet
AI Image Classification With Neural Network 221002564 222002074
17 pages
Lecture01 Intro ToHPC
No ratings yet
Lecture01 Intro ToHPC
48 pages
Class 12 AI - Unit 1
No ratings yet
Class 12 AI - Unit 1
10 pages
Sciencefair 2023 Presentation
No ratings yet
Sciencefair 2023 Presentation
3 pages
Syllabus - Deep Learning and Edge Intelligence
No ratings yet
Syllabus - Deep Learning and Edge Intelligence
3 pages
S-1
No ratings yet
S-1
5 pages
Fam 2023 Winter Micro
No ratings yet
Fam 2023 Winter Micro
10 pages
7gt
No ratings yet
7gt
40 pages
HPC Lectures 1 5
No ratings yet
HPC Lectures 1 5
18 pages
Machine Learning / AI Internship: Experience Skills
No ratings yet
Machine Learning / AI Internship: Experience Skills
1 page
Migrate Your TensorFlow 1 Code To TensorFlow 2 - TensorFlow Core
No ratings yet
Migrate Your TensorFlow 1 Code To TensorFlow 2 - TensorFlow Core
1 page
Distributed Linear Regression Class Notes
No ratings yet
Distributed Linear Regression Class Notes
140 pages
Elaborate on the significance of Hyperparameter Optimization
No ratings yet
Elaborate on the significance of Hyperparameter Optimization
5 pages
Previous AI Projects - 10 Sample Projects
No ratings yet
Previous AI Projects - 10 Sample Projects
14 pages
HPC Note
No ratings yet
HPC Note
39 pages
ML System Optimization - Lecture 10 - Model Optimization Techniques
No ratings yet
ML System Optimization - Lecture 10 - Model Optimization Techniques
33 pages
Architecting To Support Machine Learning
No ratings yet
Architecting To Support Machine Learning
47 pages
Resume 2025
No ratings yet
Resume 2025
1 page
HPC Unit 1
100% (1)
HPC Unit 1
12 pages
ML3
No ratings yet
ML3
7 pages
A Survey On Distributed Machine Learning
No ratings yet
A Survey On Distributed Machine Learning
33 pages
AIML105
No ratings yet
AIML105
5 pages
Thesis Proposal: Scaling Distributed Machine Learning With System and Algorithm Co-Design
No ratings yet
Thesis Proposal: Scaling Distributed Machine Learning With System and Algorithm Co-Design
12 pages
HPC BOOk
No ratings yet
HPC BOOk
68 pages
Parallel Computing Unit 3 - Principles of Parallel Computing Design
No ratings yet
Parallel Computing Unit 3 - Principles of Parallel Computing Design
78 pages
NewITRAddOn
No ratings yet
NewITRAddOn
6 pages
Lecture HPC 11 Parallelization
No ratings yet
Lecture HPC 11 Parallelization
128 pages
Project Plan
No ratings yet
Project Plan
8 pages
COMPDLA08
No ratings yet
COMPDLA08
3 pages
1908.09791v5
No ratings yet
1908.09791v5
15 pages
CV Template - Scalian Benelux_FY24_DS (1)
No ratings yet
CV Template - Scalian Benelux_FY24_DS (1)
3 pages
Parallel Programming
No ratings yet
Parallel Programming
42 pages
HPC Lecture (1) Summary
No ratings yet
HPC Lecture (1) Summary
8 pages
Advance Course Outline CU 2.0
No ratings yet
Advance Course Outline CU 2.0
6 pages
ML Lab
No ratings yet
ML Lab
13 pages
paper1
No ratings yet
paper1
4 pages
Experience of Developing Sparse Matrix Algorithms and Software For Sustainablity
No ratings yet
Experience of Developing Sparse Matrix Algorithms and Software For Sustainablity
22 pages
Advanced Computer Architecture Fall 2019 Multithreaded Architectures
No ratings yet
Advanced Computer Architecture Fall 2019 Multithreaded Architectures
31 pages
SIPGA_2024-12-26
No ratings yet
SIPGA_2024-12-26
10 pages
Pemrosesan Parale2l
No ratings yet
Pemrosesan Parale2l
27 pages
Artificial Intelligence(Advance) Notes?
No ratings yet
Artificial Intelligence(Advance) Notes?
33 pages
zeroxxxxs1 (3)
No ratings yet
zeroxxxxs1 (3)
12 pages
final syllabus
No ratings yet
final syllabus
5 pages
Ahishek file
No ratings yet
Ahishek file
6 pages
Ubed M A Final Resume-1
No ratings yet
Ubed M A Final Resume-1
2 pages
Basic_concepts_of_Machine_Learning_for_Beginners_1732109263
No ratings yet
Basic_concepts_of_Machine_Learning_for_Beginners_1732109263
102 pages
Mastering the Art of Nix Programming: Unraveling the Secrets of Expert-Level Programming
From Everand
Mastering the Art of Nix Programming: Unraveling the Secrets of Expert-Level Programming
Steve Jones
No ratings yet
Google JAX Cookbook
From Everand
Google JAX Cookbook
Zephyr Quent
5/5 (1)
Precalculus Sigma Notation Practice
No ratings yet
Precalculus Sigma Notation Practice
4 pages
JSW Jaigarh Port LTD.: Sap Order No
No ratings yet
JSW Jaigarh Port LTD.: Sap Order No
9 pages
TC1044S Charge Pump DC-TO-DC Voltage Converter: Features General Description
No ratings yet
TC1044S Charge Pump DC-TO-DC Voltage Converter: Features General Description
11 pages
INDR 372 PS EXERCISES, May 13, 2022: - (Z) - (Z) - (Z) - (Z)
No ratings yet
INDR 372 PS EXERCISES, May 13, 2022: - (Z) - (Z) - (Z) - (Z)
5 pages
Mariane Clemencio
No ratings yet
Mariane Clemencio
7 pages
Transport Phenomena - Basic Concept
100% (1)
Transport Phenomena - Basic Concept
27 pages
Test Your Reading Skills Malala's Speech
80% (5)
Test Your Reading Skills Malala's Speech
4 pages
Kawasaki KLV1000A
No ratings yet
Kawasaki KLV1000A
9 pages
Protein Synthesis Project
No ratings yet
Protein Synthesis Project
2 pages
Bte Catalog 2018 (New)
100% (1)
Bte Catalog 2018 (New)
4 pages
Fiitjee: Phase Test (JEE-Advanced)
No ratings yet
Fiitjee: Phase Test (JEE-Advanced)
15 pages
BSEM24 - Syllabus
No ratings yet
BSEM24 - Syllabus
12 pages
Tablero de Interconexión Smart-Box
No ratings yet
Tablero de Interconexión Smart-Box
12 pages
GWR4manual 409 PDF
No ratings yet
GWR4manual 409 PDF
40 pages
Question: The Stagnation Pressure and Temperature of Air Owing Past A
No ratings yet
Question: The Stagnation Pressure and Temperature of Air Owing Past A
3 pages
Jgeot 2020 70 11 943
No ratings yet
Jgeot 2020 70 11 943
2 pages
Gaussian Elimination of A 4x5 Matrix A
100% (1)
Gaussian Elimination of A 4x5 Matrix A
7 pages
Manual de Detail Drawing PTX Ecosmart Model 1800
No ratings yet
Manual de Detail Drawing PTX Ecosmart Model 1800
7 pages
Express JS
No ratings yet
Express JS
9 pages
Engineering Measurements: by Shaik Himam Saheb Icfaitech, Ifhe Hyderabad
No ratings yet
Engineering Measurements: by Shaik Himam Saheb Icfaitech, Ifhe Hyderabad
42 pages
Sadashiva Nagar School 3rd and Final Bill
No ratings yet
Sadashiva Nagar School 3rd and Final Bill
42 pages
Handy welding machine 2000W quotation
No ratings yet
Handy welding machine 2000W quotation
4 pages
R Studio Cheat Sheet
No ratings yet
R Studio Cheat Sheet
6 pages
Distributed Interleaving of Paralleled Power Converters: David J. Perreault,, and John G. Kassakian
No ratings yet
Distributed Interleaving of Paralleled Power Converters: David J. Perreault,, and John G. Kassakian
7 pages
18cvl38 - BMT Lab - Manual
No ratings yet
18cvl38 - BMT Lab - Manual
79 pages
CS31005 Algorithm-II MA 2016
No ratings yet
CS31005 Algorithm-II MA 2016
2 pages
Mass Transfer in A Spray Column During Two-Phase Extraction of Horseradish Peroxidase
No ratings yet
Mass Transfer in A Spray Column During Two-Phase Extraction of Horseradish Peroxidase
5 pages
Traingle Class 10th
No ratings yet
Traingle Class 10th
6 pages
P Ipes
No ratings yet
P Ipes
15 pages
Decimal Misconceptions
No ratings yet
Decimal Misconceptions
2 pages

Presentation 7 (2)

Uploaded by

Presentation 7 (2)

Uploaded by

Optimizing

• Added safety: Switched to single mode if

You might also like