0% found this document useful (0 votes)

5 views32 pages

SL LMRG

The document outlines various machine learning techniques, focusing on supervised learning methods such as regression, classification, and clustering. It details linear regression models, including simple, polynomial, and multiple regression, along with regularization techniques like Ridge and Lasso regression. Additionally, it provides exercises for applying these concepts using real data, emphasizing the importance of feature engineering and model evaluation.

Uploaded by

hanyeelovesgod

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

5 views32 pages

SL LMRG

Uploaded by

hanyeelovesgod

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 32

035.

001 Spring, 2024

Digital Computer Concept and Practice

Supervised Learning (3)

Soohyun Yang

College of Engineering
Department of Civil and Environmental Engineering
Types of ML techniques – All learning is learning!
Our scope

•
Classification
“Presence of labels”
Advertisement popularity •
“Absence of labels”
Recommender systems (YT) •
“Behavior-driven : feedback loop”
Learning to play games (AlphaGo)
• Spam classification • Clustering
Buying habits (group customers) • Industrial simulation
• Regression
Face recognition • Grouping user logs • Resource management

https://towardsdatascience.com/what-are-the-types-of-machine-learning-e2b9e5d1756f
Regression
 A statistical method to determine the relationship between
a dependent variable (target) and one or more independent variable
s (features), predicting a target value on a continuous scale for a giv
en new data.

 Algorithms of our scope

• K-nearest neighbor (KNN)
• Linear regression (LR) => Simple, Polynomial, Multiple
• Ridge regression
Regularization
• Lasso regression
• Decision trees === // Ensemble // ===> Random forest
Linear Regression (LR) models
 Describe a continuous target variable as a linear combination of one
or more features.
 Aim to find a set of model parameters (coefficients and y-intercept),
which minimizes the sum of squared residuals (a.k.a. offset).
 Example :

Target

A feature Raschka & Mirjalili (2019);

LR models (con’t)
 Simple LR [단순 선형회귀]:
A LR model with a single feature variable x.
 Polynomial LR [다항 선형회귀]:
A LR model with an n-th degree polynomial in one feature x.

https://www.javatpoint.com/machine-learning-polynomial-regression
LR models (con’t)
 Multiple LR [다중 선형회귀]:
A LR using more than one feature (xn; n>1)to predict a target y.

y = w0 + w1 x1 + w2 x2 +  + wn xn

https://www.shiksha.com/online-courses/articles/multiple-linear-regression/
LR models (con’t)
 Feature engineering [특성 공학]:
The process of selecting, manipulating, and transforming raw
features into desired features to obtain better performance in
supervised learning.
 Example: 2nd-degree with two raw features (x1 and x2)
2 2
y =w0 + w1 x1 + w2 x2 + w3 x1 x2 + w x + w5 x2
4 1
Exercise 1: Simple LR approach
 Let’s apply for the simple LR algorithm
to resolve a regression problem.
 1. Data preparation & import :
InClassData_Traffic_Reg.csv
Input

Feature 1 Feature 2 Feature 3 Target

Samples
Exercise 1: Simple LR approach (con’t)
 Let’s apply for the simple LR algorithm
to resolve a regression problem.
 1*. Visualize the whole data to
understand it easily
Traffic volume distribution
Exercise 1: Simple LR approach (con’t)
 2. Select a specific feature for the
simple LR analysis (here, “Feature 1”)
Input
Feature 1 Target

Samples
Exercise 1: Simple LR approach (con’t)
 3. Data separation into the
training and test sets
• random_state [integer] : A parameter
for the random number generator.
• DO NOT NEED ‘Stratification’ process
for regression problem.

 4. Reshape 1-D training sets

as 2-D array
Exercise 1: Simple LR approach (con’t)
 5. Import ‘LinearRegression’ class
and create its instance.
 6. Fit the regression model using
the training set (fit method).
 7. Make predictions on the test
data (predict method).
 8. Evaluate the model’s
performance (score method
=> via R2, 결정계수).
Exercise 1: Simple LR approach (con’t)
 9. Check the resultant coefficients
and the y-intercept
• coef_ : Estimated coefficients for the
linear regression problem, from the
highest to the 1st orders. => Array type
• intercept_ : Estimated y-intercept.
=> Float or array type

Y = 191.96 X – 441.7
Does the regression
make sense?
Exercise 2: Polynomial LR approach
 1. Let’s introduce a second-degree
polynomial function, defined as:
Target = w0 + w1F + w2F2
(where F = feature 1)
>> Note : We intend to make ‘two’ features
as [F, F2]. A dataset should be defined to
contain the newly formulated features.
 2. Import ‘LinearRegression’ class
and create its instance.
 3. Execute the Fit-Predict-Score
methods.
 4. Yield the model’s performance.
Exercise 2: Polynomial LR approach (con’t)
 5. Check the resultant
coefficients and the y-
intercept
=> T = 25.6F2 – 48.6F + 43.9

 6. Visual examination
Exercise 3-1: Multiple LR approach (2nd-degree)
 1. Set a set of multiple
Three features features to express all
possible combinations.
 2. Data separation into the
training and test sets
 3. Import
2nd-degree is
‘PolynomialFeatures’ class
the default option!
and create its instance.
• Include_bias = False : The
intercept term is not included in
the output features.
 4. Create the transformed
training & test sets
(fit_transform method).
Exercise 3-1: Multiple LR approach (con’t)
 5. Execute the Fit-Predict-Score
methods.
 6. Yield the model’s performance.

Same order of coefficients!

Exercise 3-2: Multiple LR approach (5th-degree)
 1. Set a set of multiple
features to express all
possible combinations.
55 variables !
 2. Data separation into the
training and test sets
 3. Implement
‘PolynomialFeatures’ class
with a specific option
“degree = 5” and create its
instance.
 4. Create the transformed
training & test sets
(fit_transform method).
Exercise 3-2: Multiple LR approach (5th-degree)
Summarized R2
 Simple LR model (with one feature, Feature 1)

 Polynomial LR model (2nd-degree with one feature, Feature 1)

 Multiple LR model (2nd-degree with three features, Features 1~3)

 Multiple LR model (5th-degree with three features, Features 1~3)

=> Extremely overfitted model to the trained data
Regularization
 A technique is used to reduce errors by fitting the function
appropriately on the given training set and avoiding overfitting.
(i.e., reduce model complexity)
 Is controlled by ‘Hyperparameter[하이퍼파라미터]’
- A parameter is not learned by a model, Our goal : To find the combination of
but assigned by a user. weight coefficients that minimize the
cost function for the training data

 Techniques:
• Ridge regression – L2 regularization
• Lasso regression – L1 regularization

Raschka & Mirjalili (2019);

• Elastic Net regression – L1 and L2 regularization

Cost function
Ridge regression
 An L2 penalized model where we simply add the squared sum of the
weights to our least-squares cost function.

Raschka & Mirjalili (2019);

 Greater the value of hyperparameter 𝜆𝜆
=> Increase the regularization strength
=> shrink the weights of our model.
>> Note : Don't regularize the y-intercept term, 𝑤𝑤0.
Example : Ridge regression

Default : Alpha = 1
Example : Ridge regression (con’t)
Example : Ridge regression (con’t)
Lasso regression
 Lasso (Least Absolute Shrinkage and Selection Operator)
 Depending on the regularization strength, certain weights can
become zero, being useful to select supervised features.

Raschka & Mirjalili (2019);

Example : Lasso regression
Default : Alpha = 1
Example : Lasso regression (con’t)
Example : Lasso regression (con’t)
Summarize the results
Polynomial Features (with 5th-degree):
Coefficients 2.7.E+02 3.2.E+02 -1.1.E+03 8.0.E+02 1.4.E+03
-1.0.E+03 2.2.E+03 -1.5.E+03 3.4.E+02 1.6.E+03
2.8.E+03 -9.1.E+02 4.8.E+03 -1.1.E+03 2.7.E+02
7.8.E+03 -2.1.E+03 1.7.E+02 -1.8.E+01 -6.8.E+02
-8.6.E+02 2.5.E+01 -6.7.E+02 -2.6.E+02 1.1.E+02
7.7.E+01 7.7.E+01 -6.6.E-01 -1.2.E+01 9.1.E+02 Variables :
-4.4.E+02 7.0.E+01 -2.8.E+00 3.7.E-01 4.8.E+03 x0 x1 x2 x0^2 x0 x1
2.1.E+03 -1.8.E+03 -1.7.E+03 2.9.E+02 1.9.E+02 x0 x2 x1^2 x1 x2 x2^2 x0^3
-5.3.E+03 2.0.E+03 -2.5.E+02 -2.9.E+00 -5.3.E+03 x0^2 x1 x0^2 x2 x0 x1^2 x0 x1 x2 x0 x2^2
2.3.E+03 -4.0.E+02 3.5.E+01 -6.8.E-01 4.5.E+03 x1^3 x1^2 x2 x1 x2^2 x2^3 x0^4
-1.8.E+03 2.7.E+02 -1.9.E+01 4.0.E-01 -1.1.E-03 x0^3 x1 x0^3 x2 x0^2 x1^2 x0^2 x1 x2 x0^2 x2^2
Intercept -6.5.E+04 x0 x1^3 x0 x1^2 x2 x0 x1 x2^2 x0 x2^3 x1^4
x1^3 x2 x1^2 x2^2 x1 x2^3 x2^4 x0^5
x0^4 x1 x0^4 x2 x0^3 x1^2 x0^3 x1 x2 x0^3 x2^2
Ridge Regression :
x0^2 x1^3 x0^2 x1^2 x2 x0^2 x1 x2^2 x0^2 x2^3 x0 x1^4
Coefficients 4.5.E+00 1.3.E+01 -7.6.E+00 1.7.E+01 1.6.E+01
x0 x1^3 x2 x0 x1^2 x2^2 x0 x1 x2^3 x0 x2^4 x1^5
6.2.E+00 2.3.E+01 1.6.E+01 -1.9.E+00 2.0.E+01
x1^4 x2 x1^3 x2^2 x1^2 x2^3 x1 x2^4 x2^5
1.4.E+01 1.1.E+01 1.4.E+01 1.3.E+01 5.4.E+00
2.3.E+01 2.3.E+01 1.6.E+01 2.1.E+00 1.5.E+01
5.2.E+00 8.3.E+00 7.2.E-01 4.6.E+00 4.2.E+00
3.5.E+00 7.9.E+00 8.1.E+00 3.0.E+00 1.5.E+01
All 55 features are considered..!
2.0.E+01 2.1.E+01 1.6.E+01 4.5.E+00 6.2.E+00 Too much complicated… 
-7.3.E+00 -6.2.E-01 -1.6.E+01 -8.4.E+00 -3.8.E+00
-1.8.E+01 -1.0.E+01 -5.1.E+00 -3.7.E+00 -1.2.E+01
-4.2.E+00 1.0.E+00 2.6.E+00 -4.2.E-01 1.4.E+00
1.0.E+01 1.6.E+01 1.7.E+01 1.4.E+01 5.5.E+00
Intercept 4.0.E+02
Summarize the results (con’t)
Lasso Regression : Variables :
Coefficients 0 0 0 6.0.E+01 0 x0 x1 x2 x0^2 x0 x1
0 5.3.E+01 0 0 2.2.E+01 x0 x2 x1^2 x1 x2 x2^2 x0^3
0 0 0 0 0 x0^2 x1 x0^2 x2 x0 x1^2 x0 x1 x2 x0 x2^2
0 1.1.E+02 0 0 0 x1^3 x1^2 x2 x1 x2^2 x2^3 x0^4
0 0 0 0 0 x0^3 x1 x0^3 x2 x0^2 x1^2 x0^2 x1 x2 x0^2 x2^2
0 0 0 0 0 x0 x1^3 x0 x1^2 x2 x0 x1 x2^2 x0 x2^3 x1^4
0 5.1.E+01 2.1.E+01 0 0 x1^3 x2 x1^2 x2^2 x1 x2^3 x2^4 x0^5
0 0 0 0 0 x0^4 x1 x0^4 x2 x0^3 x1^2 x0^3 x1 x2 x0^3 x2^2
0 0 0 0 0 x0^2 x1^3 x0^2 x1^2 x2 x0^2 x1 x2^2 x0^2 x2^3 x0 x1^4
0 0 0 0 0 x0 x1^3 x2 x0 x1^2 x2^2 x0 x1 x2^3 x0 x2^4 x1^5
0 0 0 3.1.E+01 0 x1^4 x2 x1^3 x2^2 x1^2 x2^3 x1 x2^4 x2^5
Intercept 4.0.E+02

The 7 features with non-zero coefficients

are considered the most influential or
informative for the predictive model..!
Take-home points (THPs)
-
-
-
…

120 DS-With Answer
100% (1)
120 DS-With Answer
32 pages
ML Lab Manual
100% (1)
ML Lab Manual
37 pages
Essentials of Linear Regression in Python
No ratings yet
Essentials of Linear Regression in Python
23 pages
Chapter 4 - Linear Model: Prepared By: Shier Nee, SAW Based On: Probabilistic Machine Learning by Kevin Murphy
No ratings yet
Chapter 4 - Linear Model: Prepared By: Shier Nee, SAW Based On: Probabilistic Machine Learning by Kevin Murphy
42 pages
(Slide) Non Linear Regression
No ratings yet
(Slide) Non Linear Regression
39 pages
Linear Regression Python Programming
No ratings yet
Linear Regression Python Programming
25 pages
Mlfa Autumn 22 Lec 02
No ratings yet
Mlfa Autumn 22 Lec 02
24 pages
Lecture-2 Unit 2
No ratings yet
Lecture-2 Unit 2
56 pages
Lecture - 6 Classification (Logistic Regression)
No ratings yet
Lecture - 6 Classification (Logistic Regression)
48 pages
Chapter+3+ ++Regression+Algorithms
No ratings yet
Chapter+3+ ++Regression+Algorithms
22 pages
Supervised and Unsupervised Learning Algorithm-2
No ratings yet
Supervised and Unsupervised Learning Algorithm-2
52 pages
Lec8 Regularization Polynomial Regression
No ratings yet
Lec8 Regularization Polynomial Regression
30 pages
ML4 Linear Models
No ratings yet
ML4 Linear Models
34 pages
LR LogReg
No ratings yet
LR LogReg
53 pages
COL774 Practice Problems
No ratings yet
COL774 Practice Problems
22 pages
ML 1
No ratings yet
ML 1
24 pages
Regression Models
No ratings yet
Regression Models
5 pages
Wk05 Machine Learning
No ratings yet
Wk05 Machine Learning
6 pages
Lecture 02
No ratings yet
Lecture 02
43 pages
Machine Learning Notes Cs229 1
No ratings yet
Machine Learning Notes Cs229 1
217 pages
CH 03 Regression Techniques
No ratings yet
CH 03 Regression Techniques
74 pages
3.1 Linear and Logistic Regression
No ratings yet
3.1 Linear and Logistic Regression
36 pages
19BCS2059 DL1
No ratings yet
19BCS2059 DL1
4 pages
Ch-2 Supervised Machine Learning
No ratings yet
Ch-2 Supervised Machine Learning
48 pages
Lab Manual 05
No ratings yet
Lab Manual 05
13 pages
Logistic Regression
No ratings yet
Logistic Regression
74 pages
ML Summary PDF
No ratings yet
ML Summary PDF
5 pages
Unit-Vi 2
No ratings yet
Unit-Vi 2
31 pages
CS435 Ch6
No ratings yet
CS435 Ch6
14 pages
Regression
No ratings yet
Regression
16 pages
Machine Learning Lecture 1
No ratings yet
Machine Learning Lecture 1
5 pages
2EL1730 ML Lecture02 Linear and Logistic Regression
No ratings yet
2EL1730 ML Lecture02 Linear and Logistic Regression
65 pages
Unit 2
No ratings yet
Unit 2
8 pages
Predictive Analytics
No ratings yet
Predictive Analytics
46 pages
Regularization and Feature Selectio N
No ratings yet
Regularization and Feature Selectio N
102 pages
Lec1 PDF
No ratings yet
Lec1 PDF
56 pages
Locally Weighted Regression Algorithm
No ratings yet
Locally Weighted Regression Algorithm
6 pages
Mla Unit 2
No ratings yet
Mla Unit 2
99 pages
Lecture 3 - Regression
No ratings yet
Lecture 3 - Regression
47 pages
GradientDescent-Regression Slides
No ratings yet
GradientDescent-Regression Slides
26 pages
Lecture 5 - Linear Regression
No ratings yet
Lecture 5 - Linear Regression
51 pages
ML Unit
No ratings yet
ML Unit
23 pages
Logistic Regression
No ratings yet
Logistic Regression
42 pages
LP III Lab Manual
100% (1)
LP III Lab Manual
8 pages
Unit - Iii Supervisied Learning - Notes
No ratings yet
Unit - Iii Supervisied Learning - Notes
42 pages
Module B Handbook
No ratings yet
Module B Handbook
11 pages
AI Lec 2
No ratings yet
AI Lec 2
49 pages
Progression Linaire
No ratings yet
Progression Linaire
187 pages
Unit 6
No ratings yet
Unit 6
107 pages
Regression Questionnaire
No ratings yet
Regression Questionnaire
10 pages
Regression
No ratings yet
Regression
16 pages
Assignment No.4 - (20-Ele-68)
No ratings yet
Assignment No.4 - (20-Ele-68)
17 pages
Lecture3 Upload
No ratings yet
Lecture3 Upload
28 pages
AI & ML Unit 3 Notes
No ratings yet
AI & ML Unit 3 Notes
20 pages
ML Lecture - 3
No ratings yet
ML Lecture - 3
47 pages
Comprehensive Machine Learning Tutorial - Regressio
No ratings yet
Comprehensive Machine Learning Tutorial - Regressio
9 pages
03 Regression
No ratings yet
03 Regression
35 pages
DATA MINING and MACHINE LEARNING: CLUSTER ANALYSIS and kNN CLASSIFIERS. Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING: CLUSTER ANALYSIS and kNN CLASSIFIERS. Examples with MATLAB
César Pérez López
No ratings yet
Multiple Models Approach in Automation: Takagi-Sugeno Fuzzy Systems
From Everand
Multiple Models Approach in Automation: Takagi-Sugeno Fuzzy Systems
Mohammed Chadli
No ratings yet
Geometric Feature Learning: Unlocking Visual Insights through Geometric Feature Learning
From Everand
Geometric Feature Learning: Unlocking Visual Insights through Geometric Feature Learning
Fouad Sabry
No ratings yet
Mastering Data Structures and Algorithms in Python & Java
From Everand
Mastering Data Structures and Algorithms in Python & Java
Sachin Naha
No ratings yet
2007 S Fin
No ratings yet
2007 S Fin
1 page
2006 S Fin
No ratings yet
2006 S Fin
1 page
2005 S Fin
No ratings yet
2005 S Fin
1 page
2003 S Fin
No ratings yet
2003 S Fin
1 page
CM15 Extreme Value Distributions
No ratings yet
CM15 Extreme Value Distributions
7 pages
Master Budgeting
No ratings yet
Master Budgeting
82 pages
Managerial Accounting and Cost Concepts
No ratings yet
Managerial Accounting and Cost Concepts
66 pages
MRV1
No ratings yet
MRV1
6 pages
Chapter 1
No ratings yet
Chapter 1
37 pages
EFM Ch6
No ratings yet
EFM Ch6
35 pages
Digital Computer Concept and Practice: Supervised Learning
No ratings yet
Digital Computer Concept and Practice: Supervised Learning
30 pages
HandsOnExs Variables and DataType
No ratings yet
HandsOnExs Variables and DataType
3 pages
Data Type
No ratings yet
Data Type
22 pages
Week07b FitProbDist
No ratings yet
Week07b FitProbDist
19 pages
Trend, Variation, and Universal Kriging
No ratings yet
Trend, Variation, and Universal Kriging
12 pages
Prediction of Power Conversion Efficiency Parameter of Inverted Organic Solar Cells Using Artificial Intelligence Techniques
No ratings yet
Prediction of Power Conversion Efficiency Parameter of Inverted Organic Solar Cells Using Artificial Intelligence Techniques
24 pages
NoCA2019-ProxyML 2019nov29
No ratings yet
NoCA2019-ProxyML 2019nov29
24 pages
Machine Learning System Design PDF
100% (1)
Machine Learning System Design PDF
14 pages
Review - 3 - Load Forecasting PDF
No ratings yet
Review - 3 - Load Forecasting PDF
25 pages
10 57020-Ject 1579598-4340965
No ratings yet
10 57020-Ject 1579598-4340965
12 pages
Machine Learning With Python Unit 1-17-84 Final13092024
No ratings yet
Machine Learning With Python Unit 1-17-84 Final13092024
68 pages
Datascience Syllabus PDF
No ratings yet
Datascience Syllabus PDF
4 pages
Ay-Sem8-Internship Report
No ratings yet
Ay-Sem8-Internship Report
34 pages
Machine Learning Unit 4
100% (1)
Machine Learning Unit 4
78 pages
Machine Learning - Unit - 1
100% (1)
Machine Learning - Unit - 1
58 pages
Ds & ML Project (IBM)
No ratings yet
Ds & ML Project (IBM)
9 pages
Predictive Overfitting in Immunological Applicatio
No ratings yet
Predictive Overfitting in Immunological Applicatio
11 pages
Data Mining Techniques Unit 2
No ratings yet
Data Mining Techniques Unit 2
48 pages
l7 - Learning in Multi-Layer Perceptrons, Back-Propagation
No ratings yet
l7 - Learning in Multi-Layer Perceptrons, Back-Propagation
16 pages
Machine Learning Andrew NG Week 6 Quiz 1
No ratings yet
Machine Learning Andrew NG Week 6 Quiz 1
8 pages
CS3491 Artificial Intelligence and Machine Learning Two Mark Questions 1
No ratings yet
CS3491 Artificial Intelligence and Machine Learning Two Mark Questions 1
23 pages
Evaluating Predictive Models in Cybersecurity: A Comparative Analysis of Machine and Deep Learning Techniques For Threat Detection
No ratings yet
Evaluating Predictive Models in Cybersecurity: A Comparative Analysis of Machine and Deep Learning Techniques For Threat Detection
6 pages
Confusion Matrix
No ratings yet
Confusion Matrix
26 pages
Book Summary
No ratings yet
Book Summary
35 pages
KLS'S Vishwanathrao Deshpande Institute of Technology, Haliyal
No ratings yet
KLS'S Vishwanathrao Deshpande Institute of Technology, Haliyal
17 pages
Internship Report Iot
No ratings yet
Internship Report Iot
31 pages
Chapter 6 Deep Learning Knowledge
No ratings yet
Chapter 6 Deep Learning Knowledge
24 pages
Algorithms To Live by by Tom Griffiths PDF
No ratings yet
Algorithms To Live by by Tom Griffiths PDF
47 pages
Apegs Salary Survey Summary Results 2021
No ratings yet
Apegs Salary Survey Summary Results 2021
4 pages
Timeseries Augmentation and Model Selection
No ratings yet
Timeseries Augmentation and Model Selection
39 pages
Machine Learning A Z Q A
100% (1)
Machine Learning A Z Q A
52 pages
Image Classification of An American Sign Language Dataset: Objectives
No ratings yet
Image Classification of An American Sign Language Dataset: Objectives
11 pages
Literature Review On Feature Selection Methods For High-Dimensional Data
No ratings yet
Literature Review On Feature Selection Methods For High-Dimensional Data
10 pages

SL LMRG

Uploaded by

SL LMRG

Uploaded by

035.

001 Spring, 2024

Digital Computer Concept and Practice

 Algorithms of our scope

A feature Raschka & Mirjalili (2019);

Feature 1 Feature 2 Feature 3 Target

 4. Reshape 1-D training sets

Same order of coefficients!

 Polynomial LR model (2nd-degree with one feature, Feature 1)

 Multiple LR model (2nd-degree with three features, Features 1~3)

 Multiple LR model (5th-degree with three features, Features 1~3)

Raschka & Mirjalili (2019);

Raschka & Mirjalili (2019);

Raschka & Mirjalili (2019);

The 7 features with non-zero coefficients

You might also like