0% found this document useful (0 votes)

46 views

Machine Learning

grown) decision trees to improve accuracy and avoid overfitting. It averages the predictions of each tree to make a final prediction.

Uploaded by

Firas Gerges

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

46 views

Machine Learning

grown) decision trees to improve accuracy and avoid overfitting. It averages the predictions of each tree to make a final prediction.

Uploaded by

Firas Gerges

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 30

Machine Learning

Overview
Techniques and Applications
By
Firas Gerges

1
Outline
• What is Machine Learning
• Machine Learning vs. Traditional
Model
• Steps in Machine Learning
• Machine Learning Types and
Algorithms
• Machine Learning Use Case

2
Machine Learning

• Machine Learning (ML), a subfield of Artificial

Intelligence (AI), is a set of methods and approaches
that leverage algorithms and data to imitate
intelligent human behavior: Learning from
experience.

• ML learns from data (situation => outcome) to

predict the outcome of a new given situation.

• Situation: Features/Attributes/Predictors/Input
• Outcome:
Target/class/Prediction/Predictand/Output

3
The Learning Problem

• Suppose we observe the output space and the input space

• Task is to find a relationship/mapping function f between and :

• (ε is a random error (noise) term, independent of X )

4
The Learning Problem

• We cannot compute f, but we can estimate it by learning from data.

• Role of Machine Learning is to construct as an estimate of f by learning from the data.

• Two main objectives:

• Prediction: use on X to compute a prediction of ((X))
• Inference: use to study relationship between and

5
The Learning Problem: How to estimate f

• ML techniques are used to estimate f. Different ML techniques have different

formulations and assumptions around the form and type of f (linear, non-linear,
decision tree, etc.)

• Estimate f:
1. Construct observed set of “training” data: {}
2. Use “training” data as input to a ML technique to construct

6
ML vs.
Traditional
Computing

7
Machine Learning
Types

• Supervised Learning:
• Regression
• Classification
• Unsupervised Learning:
• Clustering
• Association Analysis
• Reinforcement Learning

8
Supervised vs.
Unsupervised Learning

Morimoto, J., Ponton, F. Virtual reality in

biology: could we become virtual naturalists?.
Evo Edu Outreach 14, 7 (2021).
https://doi.org/10.1186/s12052-021-00147-x

9
How to
Perform
Machine
Learning

10
Steps Perform Machine Learning

Objectives and Data Processing and

Data Acquisition
Goals Definition Feature Engineering

ML Algorithm Model
Model Testing
Selection Training/Learning

Model Deployment

11
Objective and Goals Definition
We often don't just start with
the training data set and plug
in a learning algorithm to find
the predictor

Problem statement: Forecast Questions:

the river discharge
We must define and formulate
the problem: Input and Output

Predicting what? Continuous or Prediction/Forecasting or

Predicting or Forecasting? Site specific or general model?
discrete? Quantitative or qualitative? Inference?

Example of prediction: Use

Example of forecasting: Use
temperature, wind speed,
data known only in the past (t-
humidity, etc. at time t to
x) to predict discharge at time
predict river discharge at time
t
t

12
Data Acquisition
• Define what data to use as input to the model:
• Data type: textual, images, numerical, etc. (or
combination)
• Cost: Data collection is often costly; researchers should
think of what kind of data would be most useful to solve
the problem. This requires “domain knowledge”.
• Data availability

• Be Careful: Seasonality and temporal

dependencies
13
Processing and Feature Engineering

Feature engineering is the process of

selecting, manipulating, and transforming raw Some steps we often see in feature
data into features that can be used in Machine engineering:
Learning.

Data analysis to
Data cleaning,
Extracting Textual to keep/drop
Normalization/ handling missing
features from numerical features, features
Standardization values, removing
images transformation embedding and
outliers, etc.
encoding.

14
Problem is defined
Linear/logistic
Regression

Machine Constraints are

defined K-Nearest Neighbors
Learning (KNN)

Algorithms: Data are collected and

processed Classical ML Support Vector
techniques: Machines (SVM)
Supervised Next: Selecting ML
Learning technique to train:
Decision Trees (DT)
Neural Networks
(NN) / Deep Learning
Random Forest (RF)

15
Linear Regression
• Linear Regression assumes that there is a linear relationship
present between input features and the output.
• It aims to find the best fitting line (or plane) that describes
two or more variables.

• Linear Regression is used to predict continuous output

• Logistic Regression is used for classification

16
K-Nearest Neighbors

• KNN is instance-based ML technique that

does not require training.
• Idea is to label the new instance (data
case) based on the top K nearest
instances:
• Nearest = Distance Measure

17
Support Vector
Machines

• SVM maps the data into a high

dimensional space and tries to find a
hyperplane that separates the cases
based on their class label
• SVM works by maximizing the width of
the area separating the different classes,
in order to minimize the margin of the
error of the desired hyperplane

18
Decision Tree Learning
Decision tree learning is a method for approximating target
functions, in which the learned function is represented by a decision
tree
These learning methods are among the most popular, due to their
efficiency and their white-box nature

A decision tree is simply a series of sequential decisions made to

reach a specific result
Mitchell, Tom M., and Tom M. Mitchell. Machine learning.
Vol. 1. No. 9. New York: McGraw-hill, 1997.

Tree is built following concepts from information theory related to

entropy and information gain

19
Random Forest is a tree-based
algorithm that leverages multiple
decision trees when making a
prediction

Random Random Forest combines the

Forest vs output of multiple (randomly

created) Decision Trees to predict

Decision Tree the final output

It will generate different results

each time

20
• It consists of units connected through weighted edges
• It learns weights that minimize the error between
Neural Networks predictions and observations

21
• Linear Regression:
• Pros:
• Simple and Effective
How to • No parameter tuning is necessary
• Features importance (scale before)
Choose a • Perform well on linear data
• Fast
ML • Cons:
• Poor performance on non-linear data
Algorithm • Poor performance with irrelevant and highly
correlated features
• Requires feature engineering to only keep
relevant data

22
• K-Nearest Neighbors:
• Pros:
• Simple and easy to understand
How to • Only one parameter to tune

Choose a • No assumption about the data

• Can easily be changed to handle new data

ML • Cons:
• Poor performance on data with a lot of
Algorithm features
• Requires data scaling
• Very sensitive to outliers
• Poor performance on imbalanced data

23
• Support Vectors:
• Pros:

How to • Suitable for data with high dimensions

• Impact of outliers is minimal
Choose a • Should (often) always outperform
Linear Regression
ML • Cons:
• Very slow to train
Algorithm • Sensitive to noise (overlapped cases)
• Selection of hyperparameters (and
kernel) is very important

24
• Random Forests:
• Pros:

How to • Good Performance on Imbalanced datasets

• Handling of huge amount of data with high

Choose a dimensionality
• Impact of outliers is minimal

ML • Features importance
• Irrelevant features won’t affect performance (only

Algorithm
for decision tree)
• Cons:
• Large number of trees can make the algorithm too
slow and ineffective for real-time predictions

25
• Neural Networks:
• Pros:

How to • Good for nonlinear data with large number of

inputs

Choose a • Once trained, the predictions are very fast

• Can explore deep hidden relationship between
data
ML • Cons:
• Black box
Algorithm • Requires a lot of training data
• Computationally expensive
• Large possibilities of architectures and parameters

26
Now it is time to train the model

Original data set is divided into training

Model and testing

Training Training data is used to train and cross-

validate the model:
• Feature analysis and scaling
• Model training
• Parameters and architecture tuning

27
Once a model shows good
performance in the
training/validation step, testing
data is used
Model
Testing/Evaluation
This is called Evaluation Phase:
testing the performance of the
model on new, unseen data

28
ML Advantages, Challenges, and Limitations

Advantages: Limitations:
Identifies trends and patterns Domain expertise is often required
No human intervention is often needed Challenges in interpreting results
Always learning and improving Requires time and resources
Handles multi-dimensional and heterogeneous data High error susceptibility
Wide applications High data dependency

29
Recommended Readings
Maini, V., & Sabri, S. (2017). Machine learning for humans. Online:
https://medium. com/machine-learning-for-humans.
Géron, A. (2022). Hands-on machine learning with Scikit-Learn,
Keras, and TensorFlow. O'Reilly Media, Inc.
Mitchell, T. M. (1997). Machine learning (Vol. 1, No. 9). New York:
McGraw-hill.
Goodfellow, I., Bengio, Y., & Courville, A. (2016). Deep learning.
MIT press.
30

MACHINE LEARNING R23 material
100% (8)
MACHINE LEARNING R23 material
32 pages
Machine Learning?
100% (2)
Machine Learning?
114 pages
ML_7th_Sem_AIML_ITE_Notes_Complete_LONG[1]-10-33
No ratings yet
ML_7th_Sem_AIML_ITE_Notes_Complete_LONG[1]-10-33
24 pages
1 - Machine Learning Overview
No ratings yet
1 - Machine Learning Overview
56 pages
Introduction to Machine Learning
No ratings yet
Introduction to Machine Learning
19 pages
Unit-I
No ratings yet
Unit-I
23 pages
LECTURE-2
No ratings yet
LECTURE-2
36 pages
Machine Learning for Data Science Unit-4
No ratings yet
Machine Learning for Data Science Unit-4
16 pages
Machine Learning.
No ratings yet
Machine Learning.
50 pages
Basic of Machine Learning
No ratings yet
Basic of Machine Learning
7 pages
Lecture01 Introduction To Machine Learning (Chapter1)
No ratings yet
Lecture01 Introduction To Machine Learning (Chapter1)
64 pages
Introduction to Machine Learning
No ratings yet
Introduction to Machine Learning
3 pages
Machine Learning
No ratings yet
Machine Learning
24 pages
Machine: Learning ATO Z - I
No ratings yet
Machine: Learning ATO Z - I
131 pages
Karthik
No ratings yet
Karthik
10 pages
Module 1 ML
No ratings yet
Module 1 ML
51 pages
ML Unit 1
No ratings yet
ML Unit 1
9 pages
ML Unit 1
No ratings yet
ML Unit 1
21 pages
Lecture Notes on Machine Learning Concepts.docx
No ratings yet
Lecture Notes on Machine Learning Concepts.docx
5 pages
presenttion33
No ratings yet
presenttion33
2 pages
Introduction To Machine Learning PPT Main
No ratings yet
Introduction To Machine Learning PPT Main
15 pages
Rohit Unit 1 ML Notes
No ratings yet
Rohit Unit 1 ML Notes
27 pages
Machine Learning Tutorial_ Learn ML for Free
No ratings yet
Machine Learning Tutorial_ Learn ML for Free
9 pages
1 - Machine Learning Overview
No ratings yet
1 - Machine Learning Overview
53 pages
Class1-%20Introduction%20and%20foundation-1717413257735
No ratings yet
Class1-%20Introduction%20and%20foundation-1717413257735
23 pages
Lecture 01 Introducing ML 13102022 031101pm
No ratings yet
Lecture 01 Introducing ML 13102022 031101pm
36 pages
Social Media Analytics Techniques[1] (1)
No ratings yet
Social Media Analytics Techniques[1] (1)
77 pages
ML Notes
No ratings yet
ML Notes
52 pages
Lecture 1
No ratings yet
Lecture 1
65 pages
Machine Learning
No ratings yet
Machine Learning
54 pages
Chapter 1 - Introduction
No ratings yet
Chapter 1 - Introduction
28 pages
ML Notes-1
No ratings yet
ML Notes-1
59 pages
Unit-1 MLT
No ratings yet
Unit-1 MLT
51 pages
Presentation1.Pptx Tanushka - Copy
No ratings yet
Presentation1.Pptx Tanushka - Copy
13 pages
Lesson 4 -Introduction Machine Learning
No ratings yet
Lesson 4 -Introduction Machine Learning
44 pages
Introduction To ML
No ratings yet
Introduction To ML
48 pages
Machine Learning Presentation Updated
No ratings yet
Machine Learning Presentation Updated
20 pages
An Enlightenment To Machine Learning
100% (1)
An Enlightenment To Machine Learning
16 pages
Machine Learning in Unit-1
No ratings yet
Machine Learning in Unit-1
10 pages
MACHINE LEARNING 1-5 (Ai &DS)
100% (1)
MACHINE LEARNING 1-5 (Ai &DS)
60 pages
Introduction to Machine Learning Basics
No ratings yet
Introduction to Machine Learning Basics
12 pages
Study On Machine Learning Research Paper
No ratings yet
Study On Machine Learning Research Paper
17 pages
ML Unit1 Docx Unitr 2
No ratings yet
ML Unit1 Docx Unitr 2
46 pages
Machine learning_question bank
No ratings yet
Machine learning_question bank
45 pages
Notes Unit 1
No ratings yet
Notes Unit 1
13 pages
AI-Lecture 8 (Machine Learning Overview)
No ratings yet
AI-Lecture 8 (Machine Learning Overview)
42 pages
01 - Introduction
No ratings yet
01 - Introduction
35 pages
Session 8- Machine Learning Techniques
No ratings yet
Session 8- Machine Learning Techniques
48 pages
Ai - Foundations of Machine Learning I
No ratings yet
Ai - Foundations of Machine Learning I
39 pages
Machine Learning-Supervised Learning
No ratings yet
Machine Learning-Supervised Learning
31 pages
Class Notes: The Basics of Machine Learning
No ratings yet
Class Notes: The Basics of Machine Learning
4 pages
Chapter 5 AI
No ratings yet
Chapter 5 AI
40 pages
Module_-1
No ratings yet
Module_-1
9 pages
Introduction to Machine Learning Algorithms
No ratings yet
Introduction to Machine Learning Algorithms
3 pages
Introduction to Machine Learning
No ratings yet
Introduction to Machine Learning
4 pages
Machine Learning Notes
No ratings yet
Machine Learning Notes
77 pages
ML Unit-1
No ratings yet
ML Unit-1
28 pages
ML Lec 02 Introduction II
No ratings yet
ML Lec 02 Introduction II
22 pages
Machine Learning With Python
No ratings yet
Machine Learning With Python
6 pages
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
César Pérez López
No ratings yet
Measures of Central Tendency: (Ungrouped Data)
100% (1)
Measures of Central Tendency: (Ungrouped Data)
24 pages
Improving Demand Forecast Accuracy
No ratings yet
Improving Demand Forecast Accuracy
6 pages
EBE Ch6
No ratings yet
EBE Ch6
11 pages
Sets & Linear Inequalities
No ratings yet
Sets & Linear Inequalities
29 pages
Unit 2 - Examples SDOF Free Damped Vibration
No ratings yet
Unit 2 - Examples SDOF Free Damped Vibration
5 pages
syllabusENSC429 3
No ratings yet
syllabusENSC429 3
2 pages
Abstracts
No ratings yet
Abstracts
81 pages
Presentasi IESS
No ratings yet
Presentasi IESS
9 pages
Beyond A Gaussian Denoiser: Residual Learning of Deep CNN For Image Denoising
No ratings yet
Beyond A Gaussian Denoiser: Residual Learning of Deep CNN For Image Denoising
13 pages
Slope-Deflection Method: Structural Theory
No ratings yet
Slope-Deflection Method: Structural Theory
19 pages
Control Theory Quiz 1
100% (1)
Control Theory Quiz 1
5 pages
07 TDOF System
No ratings yet
07 TDOF System
30 pages
Consistent Design of PID Controllers For An Autopi
No ratings yet
Consistent Design of PID Controllers For An Autopi
8 pages
Data science Lab manual (1)
No ratings yet
Data science Lab manual (1)
33 pages
(eBook PDF) Linear System Theory and Design 4th Edition download pdf
No ratings yet
(eBook PDF) Linear System Theory and Design 4th Edition download pdf
45 pages
KKKQ1223 Engineering Mathematics (Linear Algebra) : Best Approximation Least Squares Least Squares Fitting To Data
No ratings yet
KKKQ1223 Engineering Mathematics (Linear Algebra) : Best Approximation Least Squares Least Squares Fitting To Data
22 pages
Matlab Deep Learning Toolbox Reference Mark Hudson Beale Martin T Hagan download
100% (1)
Matlab Deep Learning Toolbox Reference Mark Hudson Beale Martin T Hagan download
87 pages
Ответы на тесты AI Basics (Overview of AI)
No ratings yet
Ответы на тесты AI Basics (Overview of AI)
3 pages
SAP HANA PAL - K-Means Algorithm or How To Do Cust... - SAP Community-12
No ratings yet
SAP HANA PAL - K-Means Algorithm or How To Do Cust... - SAP Community-12
3 pages
Practice Final
No ratings yet
Practice Final
13 pages
Worksheet FEM
No ratings yet
Worksheet FEM
4 pages
Cs601pc - Machine Learning Unit - 1-3
No ratings yet
Cs601pc - Machine Learning Unit - 1-3
155 pages
Bisection Method
No ratings yet
Bisection Method
33 pages
Lecture Slide 1: Introduction To Control Systems
No ratings yet
Lecture Slide 1: Introduction To Control Systems
39 pages
Case Study - Graphical Method
No ratings yet
Case Study - Graphical Method
2 pages
21-Module 7 - Laplace Transform-21-10-2024
No ratings yet
21-Module 7 - Laplace Transform-21-10-2024
2 pages
Estimation_of_Number_of_Chewing_Strokes_and_Swallowing_Events_by_Using_LSTM-CTC_and_Throat_Microphone
No ratings yet
Estimation_of_Number_of_Chewing_Strokes_and_Swallowing_Events_by_Using_LSTM-CTC_and_Throat_Microphone
2 pages
Extra Exercises
No ratings yet
Extra Exercises
3 pages
Ahmadi Javid Et Al 2019 A Method For Risk Response Planning in Project Portfolio Management
No ratings yet
Ahmadi Javid Et Al 2019 A Method For Risk Response Planning in Project Portfolio Management
19 pages
Sliding Mode Observers Tutorial PDF
No ratings yet
Sliding Mode Observers Tutorial PDF
3 pages

Machine Learning

Uploaded by

Machine Learning

Uploaded by

Machine Learning

• Machine Learning (ML), a subfield of Artificial

• ML learns from data (situation => outcome) to

• Suppose we observe the output space and the input space

• Task is to find a relationship/mapping function f between and :

• We cannot compute f, but we can estimate it by learning from data.

• Role of Machine Learning is to construct as an estimate of f by learning from the data.

• Two main objectives:

• ML techniques are used to estimate f. Different ML techniques have different

Morimoto, J., Ponton, F. Virtual reality in

Objectives and Data Processing and

Problem statement: Forecast Questions:

Predicting what? Continuous or Prediction/Forecasting or

Example of prediction: Use

• Be Careful: Seasonality and temporal

Feature engineering is the process of

Machine Constraints are

Algorithms: Data are collected and

• Linear Regression is used to predict continuous output

• KNN is instance-based ML technique that

• SVM maps the data into a high

A decision tree is simply a series of sequential decisions made to

Tree is built following concepts from information theory related to

Random Random Forest combines the

Forest vs output of multiple (randomly

Decision Tree the final output

It will generate different results

Choose a • No assumption about the data

How to • Suitable for data with high dimensions

How to • Good Performance on Imbalanced datasets

How to • Good for nonlinear data with large number of

Choose a • Once trained, the predictions are very fast

Original data set is divided into training

Training Training data is used to train and cross-

You might also like