0% found this document useful (0 votes)

8 views

Lect03 CSN382

Uploaded by

Himanshu Saxena

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

8 views

Lect03 CSN382

Uploaded by

Himanshu Saxena

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 31

INDIAN INSTITUTE OF TECHNOLOGY ROORKEE

Machine Learning
CSN-382 (Lecture 3)
Dr. R. Balasubramanian
Professor
Department of Computer Science and Engineering
Mehta Family School of Data Science and Artificial Intelligence
Indian Institute of Technology Roorkee
Roorkee 247 667
[email protected]
https://faculty.iitr.ac.in/cs/bala/
Limitations of ML

There are some limitations and scope of

improvement as well.

● Related to data
○ Lack of suitable data & human bias in
the data
○ Data privacy and ethical issues
○ Rapid changes in the data
● Related to models
○ Biased models
○ Poor performance in production
○ Regular training required
○ Black box models
● Related to infrastructure
○ Expensive infrastructure requirement

2
Limitations of ML

3
How reliable are the models we
train?

[1] https://bgr.com/2018/06/22/uber-self-driving-car-crash-arizona-hulu-logs/
[2] https://www.mirror.co.uk/news/world-news/google-self-driving-car-hits-
7529261

4
Regression

● Regression generally means “stepping back towards the

average”.
● Regression analysis is also defined as the measure of
average relationship between two or more variables.
● The predicted variable is known as
dependent/target/response.
● The variable(s) which is used for prediction is known as
independent/explanatory/regressor.

5
Terminologies Related to the
Regression Analysis

► Dependent Variable
► Independent Variable
► Outliers
► Multicollinearity
► Underfitting and Overfitting

6
Types of Regression

7
What is linear regression?

● Linear regression is one of the supervised learning

algorithms that are used to identify a relationship between
two or more variables.
● This relationship can be used to predict values for one
variable, if value(s) of other variable(s) is given.
● A simple linear regression model (also called bivariate
regression) has one independent variable X that has a
linear relationship with the dependent variable Y.
y = β0 + β1x + ε
Here β0 and β1 are the parameters of the linear regression
model.
8
Regression: Use case

● Let us consider impact of a single variable for now.

Age Blood
(Independent Pressure
variable) (target
variable)

We say, that only Age decides what the Blood pressure of

a person should be.

9
Regression: Use case

Age in Blood
years pressure
Let us consider this data: (x) (y)
● The task is to predict the 25 120
blood pressure when the 36 135
age is 40. 68 143
55 139
49 120
72 165
40 ?
10
Linear Regression line

y = β0 + β1x + ε
y = set of values taken by
dependent variable Y
x = set of values taken by
independent variable X
β0 = y intercept
β1 = slope
ε = random error component

11
Linear Regression line

In context with our example,

Blood Pressure = β0 + β1 Age + ε
y = set of values taken by dependent variable, blood
pressure
x = set of values taken by independent variable, age
β0 = Blood pressure value where the best fit line cuts the Y -
axis (blood pressure)
β1 = beta coefficient for age
ε = random error component

12
What is error term?

● Error term also called residual represents the distance of the

observed value from the value predicted by regression line.
● In our example,
Error term = Actual blood pressure - Predicted blood pressure
for each observation

13
Calculating the error term
● Equation of regression line is

given by, y = β0 + β1x + ε

● Error term can be calculated

as, ε = y - (β0 + β1x)

● We have an error term for
every observation in the data.
εi = yactual - ypredicted

● Sum of squared errors = ∑ εi2

14
Which line best fits our data?

● The regression line which best

explains the trend in the data
is the best fit line
● The line with the least error
will be chosen as the best
fitting line

15
Methods to get the best fit line

● Two methods can be used to find the best fit line

Ordinary least squares Gradient Descent

16
Linear Regression Model

Relationship Between Variables Is a Linear Function

Population Population Slope Random Error

Y-Intercept

Y i   0  1 X i   i
Dependent Independent (Explanatory) Variable
(Response) Variable

17
OLS

18
19
Measures of variation

Sum of squared error (SSE):

Σ(yi-ŷi)2
Sum of squared total (SST):
Σ(yi-ȳ)2
Sum of squared regression
(SSR):
Σ(ŷi-ȳ)2
yi = observed values of y
ŷi = predicted values of y
ȳ = mean value of observed values of y

20
Gradient descent

● Using the OLS method, we get the estimates of parameters

of the linear regression model by minimizing the sum of the
square of errors.
● The gradient descent is an optimization technique which
finds the β parameters such that the error term is minimum.
● Computation speed for higher data dimension is more if
parameter were to be obtained using the OLS method
whereas the gradient descent does it faster.

21
Gradient descent

● An error function, also known as a

loss function is used to calculate Initial approximation

Loss Function
the cost associated with the
deviation of observed data from Cost Minima
Gradient

predicted data.
Beta coefficient to estimate
● It is an iterative method which
converges to the optimum solution.
● The estimates of the parameter are
updated at every iteration.

22
Gradient descent
● Consider a ball rolling down the slope as shown below
● Any position on the slope is the loss of the current values of the
coefficients (cost)
● The bottom of the slope where the cost function is minimum
● The objective is to find lowest point in the cost function by
continuously trying different values of the parameters
● Repeating this process numerous times, the best parameters are
such that the cost is minimum

23
Gradient Descent Algorithm

► Gradient Descent is an algorithm that finds the best-fit line

for a given training dataset in a smaller number of iterations.
► If we plot m and c against MSE, it will acquire a bowl shape.

24
► For some combination of m and c, we will get the least Error
(MSE). That combination of m and c will give us our best fit
line.
► The algorithm starts with some value of m and c (usually
starts with m=0, c=0). We calculate MSE (cost) at point m=0,
c=0. Let say the MSE (cost) at m=0, c=0 is 100.
► Then we reduce the value of m and c by some amount
(Learning Step). We will notice a decrease in MSE (cost).
► We will continue doing the same until our loss function is a
very small value or ideally 0 (which means 0 error or 100%
accuracy).

25
Algorithm

1. Let m = 0 and c = 0. Let L be our learning rate. It could be a

small value like 0.01 for good accuracy.
(Learning rate gives the rate of speed where the gradient
moves during gradient descent. Setting it too high would make
your path instable, too low would make convergence slow. Put
it to zero means your model isn’t learning anything from the
gradients.)

2. (a) Calculate the partial derivative of the Cost function with

respect to m. Let partial derivative of the Cost function with
respect to m be 𝐷𝑚 (With little change in m how much Cost
function changes).

26
Step 2(a)

27
Step 2(b)

28
2. (b) Similarly, let’s find the partial derivative with respect to c.
Let partial derivative of the Cost function with respect to c
be 𝐷𝑐 (With little change in c how much Cost function
changes).
3. Now update the current values of m and c using the
following equation:
𝑚 = 𝑚 − 𝐿𝐷𝑚
𝑐 = 𝑐 − 𝐿𝐷𝑐
4. We will repeat this process until our Cost function is very
small (ideally 0).

29
Problem

► Perform Linear regression using OLS and Gradient Descent

methods.

x y
2 3
4 7
6 5
8 10

30
Thank You!

ML MU Unit 3RegressionTechniquespdf 2025 02-07-10!56!37 (1)
No ratings yet
ML MU Unit 3RegressionTechniquespdf 2025 02-07-10!56!37 (1)
115 pages
CSE_412__Lab_Manual_3___Linear_Regression
No ratings yet
CSE_412__Lab_Manual_3___Linear_Regression
10 pages
Regression and Optimization in ML
No ratings yet
Regression and Optimization in ML
41 pages
Regression PPT
No ratings yet
Regression PPT
21 pages
Linear Regression
100% (1)
Linear Regression
8 pages
Basic Interview Question of Linear Regression
No ratings yet
Basic Interview Question of Linear Regression
9 pages
ML L6 Linear Regresion
No ratings yet
ML L6 Linear Regresion
54 pages
Linear Regression - Everything You Need To Know About Linear Regression
No ratings yet
Linear Regression - Everything You Need To Know About Linear Regression
17 pages
CSL0777 L12
No ratings yet
CSL0777 L12
18 pages
Lesson 8_ Regression-T
No ratings yet
Lesson 8_ Regression-T
54 pages
UNIt-3 TY
No ratings yet
UNIt-3 TY
67 pages
Linear Regression
No ratings yet
Linear Regression
83 pages
2EL1730 ML Lecture02 Linear and Logistic Regression
No ratings yet
2EL1730 ML Lecture02 Linear and Logistic Regression
65 pages
Linear Regression
No ratings yet
Linear Regression
18 pages
Linear Regression
No ratings yet
Linear Regression
24 pages
linear regression
No ratings yet
linear regression
130 pages
Regression
No ratings yet
Regression
16 pages
Linear-Regression ML
No ratings yet
Linear-Regression ML
36 pages
Linear Regression
No ratings yet
Linear Regression
36 pages
Unit 3.1 Gradient Descent in Linear Regression
No ratings yet
Unit 3.1 Gradient Descent in Linear Regression
6 pages
MECH4403 LR Week04
No ratings yet
MECH4403 LR Week04
25 pages
Linear Regression
No ratings yet
Linear Regression
10 pages
Linear Regression
No ratings yet
Linear Regression
34 pages
LinearRegression1 210720 171800
No ratings yet
LinearRegression1 210720 171800
41 pages
2-Linear Regression
No ratings yet
2-Linear Regression
31 pages
Linear Regression
No ratings yet
Linear Regression
20 pages
ML Unit-2
No ratings yet
ML Unit-2
123 pages
L. D. College of Engineering: Lab Manual For
No ratings yet
L. D. College of Engineering: Lab Manual For
70 pages
s&Ml Unit 5- q & A
No ratings yet
s&Ml Unit 5- q & A
15 pages
Linear Regression
No ratings yet
Linear Regression
62 pages
GradientDescent-Regression_slides
No ratings yet
GradientDescent-Regression_slides
26 pages
Linear Regression
No ratings yet
Linear Regression
11 pages
Supervised Learning Algorithms
No ratings yet
Supervised Learning Algorithms
20 pages
Everything You Need To Know About Linear Regression
No ratings yet
Everything You Need To Know About Linear Regression
19 pages
Introduction To Machine Learning Algorithms: Linear Regression
No ratings yet
Introduction To Machine Learning Algorithms: Linear Regression
1 page
Lec 3-5 (Function Approximation)
No ratings yet
Lec 3-5 (Function Approximation)
34 pages
Chapter_2_Linear and Logistic Regression
No ratings yet
Chapter_2_Linear and Logistic Regression
34 pages
Regression Linear Simple
No ratings yet
Regression Linear Simple
37 pages
Linear Regression: Student: Mohammed Abu Musameh Supervisor: Eng. Akram Abu Garad
No ratings yet
Linear Regression: Student: Mohammed Abu Musameh Supervisor: Eng. Akram Abu Garad
35 pages
MACHINE LEARNING ALGORITHM Unit-II
No ratings yet
MACHINE LEARNING ALGORITHM Unit-II
115 pages
Lec 3
No ratings yet
Lec 3
22 pages
Machine Learning and Deep Learning Course
No ratings yet
Machine Learning and Deep Learning Course
23 pages
linear regression (1)
No ratings yet
linear regression (1)
8 pages
Linear Regression Notes
No ratings yet
Linear Regression Notes
25 pages
2-LR_Optim
No ratings yet
2-LR_Optim
60 pages
Karthik Nambiar 60009220193
No ratings yet
Karthik Nambiar 60009220193
9 pages
2.1 Linear Regression
No ratings yet
2.1 Linear Regression
39 pages
Machine Learning Algorithns - Unit3
No ratings yet
Machine Learning Algorithns - Unit3
124 pages
Regression Analysis
No ratings yet
Regression Analysis
49 pages
ML EasySol
No ratings yet
ML EasySol
62 pages
Linear Regression
No ratings yet
Linear Regression
16 pages
Unit 3 Notes
No ratings yet
Unit 3 Notes
33 pages
Regression: Unit Iii
No ratings yet
Regression: Unit Iii
54 pages
Unit No. 2
No ratings yet
Unit No. 2
30 pages
Gradient Descent Algorithm
No ratings yet
Gradient Descent Algorithm
6 pages
Correlation and Regression: Six Sigma Thinking, #8
From Everand
Correlation and Regression: Six Sigma Thinking, #8
Sumeet Savant
5/5 (1)
Multi-dimensional Monte Carlo Integrations Utilizing Mathematica
From Everand
Multi-dimensional Monte Carlo Integrations Utilizing Mathematica
SUJAUL CHOWDHURY
No ratings yet
Exercises of Logarithms and Exponentials
From Everand
Exercises of Logarithms and Exponentials
Simone Malacrida
No ratings yet
Acceptance-Rejection Sampling and Multi-dimensional Monte Carlo Integrations Utilizing Mathematica®
From Everand
Acceptance-Rejection Sampling and Multi-dimensional Monte Carlo Integrations Utilizing Mathematica®
SUJAUL CHOWDHURY
No ratings yet
A-level Maths Revision: Cheeky Revision Shortcuts
From Everand
A-level Maths Revision: Cheeky Revision Shortcuts
Scool Revision
3.5/5 (8)
TFE Series R-410A: Features Specifications TFE Nominal Capacity Table in Tons
No ratings yet
TFE Series R-410A: Features Specifications TFE Nominal Capacity Table in Tons
1 page
Service Manual TKR 750 - B51 8556 00
No ratings yet
Service Manual TKR 750 - B51 8556 00
70 pages
Determination of Chemical and Physical Properties of Portland Cement
No ratings yet
Determination of Chemical and Physical Properties of Portland Cement
1 page
Vectors and Equilibrium
No ratings yet
Vectors and Equilibrium
9 pages
Download
No ratings yet
Download
5 pages
A Complete Illustration of The Astrological & Occult Sciences, Volume 1 - E Silby (1795)
100% (1)
A Complete Illustration of The Astrological & Occult Sciences, Volume 1 - E Silby (1795)
612 pages
Meusburger, P., Funke, J. and E. Wunder (Eds) 2009. Milieus of Creativity
No ratings yet
Meusburger, P., Funke, J. and E. Wunder (Eds) 2009. Milieus of Creativity
297 pages
Drive Gearboxes GPT EN
No ratings yet
Drive Gearboxes GPT EN
24 pages
Ir Problem
No ratings yet
Ir Problem
24 pages
U3 Feedback Research Revisited - Edna Holland Mory - University of North Carolina at Wilmington
No ratings yet
U3 Feedback Research Revisited - Edna Holland Mory - University of North Carolina at Wilmington
40 pages
DMNG232 Data Sheet Oct 2014 PDF
No ratings yet
DMNG232 Data Sheet Oct 2014 PDF
2 pages
Macroeconomics Principles and Policy 13th Edition Baumol Test Bank 1
100% (65)
Macroeconomics Principles and Policy 13th Edition Baumol Test Bank 1
36 pages
DLL PR2 - Week 4
No ratings yet
DLL PR2 - Week 4
4 pages
A Survey On Moving Object Tracking Using Image Processing: February 2017
No ratings yet
A Survey On Moving Object Tracking Using Image Processing: February 2017
7 pages
STD 11 Eng Workbook
0% (1)
STD 11 Eng Workbook
167 pages
Implementation of A Standard Work Routine Using Lean Manufacturing Tools: A Case Study
No ratings yet
Implementation of A Standard Work Routine Using Lean Manufacturing Tools: A Case Study
15 pages
Career Plan p2 Unit 15
No ratings yet
Career Plan p2 Unit 15
2 pages
Raychem Saudi Arabia Ltd. High Voltage, Medium Voltage & Low Voltage Cable Joints, Termination Kits & Heat Shrinkable Tubing
No ratings yet
Raychem Saudi Arabia Ltd. High Voltage, Medium Voltage & Low Voltage Cable Joints, Termination Kits & Heat Shrinkable Tubing
4 pages
Universiti Teknologi Mara Final Examination: Confidential CS/JAN 2012/CSC580
No ratings yet
Universiti Teknologi Mara Final Examination: Confidential CS/JAN 2012/CSC580
4 pages
Massey Ferguson MF 184 TRACTOR Service Parts Catalogue Manual (Part Number 1424832)
No ratings yet
Massey Ferguson MF 184 TRACTOR Service Parts Catalogue Manual (Part Number 1424832)
15 pages
Instrukcja Obsługi TV Samsung NU8000
No ratings yet
Instrukcja Obsługi TV Samsung NU8000
324 pages
Cambridge Book Catelogue
No ratings yet
Cambridge Book Catelogue
76 pages
LLVIP A Visible-Infrared Paired Dataset For Low-Light Vision
No ratings yet
LLVIP A Visible-Infrared Paired Dataset For Low-Light Vision
9 pages
Lenovo
100% (2)
Lenovo
33 pages
Visual Programming Paper 2021
No ratings yet
Visual Programming Paper 2021
6 pages
Filipino Character
No ratings yet
Filipino Character
2 pages
Peel Paper
No ratings yet
Peel Paper
2 pages
SPE-68789-MS One Petro
No ratings yet
SPE-68789-MS One Petro
5 pages
p310 Disc Magnet Stepper Motor Datasheet
No ratings yet
p310 Disc Magnet Stepper Motor Datasheet
1 page
16-HD Encoder v2.0
No ratings yet
16-HD Encoder v2.0
3 pages

Lect03 CSN382

Uploaded by

Lect03 CSN382

Uploaded by

INDIAN INSTITUTE OF TECHNOLOGY ROORKEE

There are some limitations and scope of

● Regression generally means “stepping back towards the

● Linear regression is one of the supervised learning

● Let us consider impact of a single variable for now.

We say, that only Age decides what the Blood pressure of

In context with our example,

● Error term also called residual represents the distance of the

given by, y = β0 + β1x + ε

● Error term can be calculated

as, ε = y - (β0 + β1x)

● Sum of squared errors = ∑ εi2

● The regression line which best

● Two methods can be used to find the best fit line

Ordinary least squares Gradient Descent

Relationship Between Variables Is a Linear Function

Population Population Slope Random Error

Sum of squared error (SSE):

● Using the OLS method, we get the estimates of parameters

● An error function, also known as a

► Gradient Descent is an algorithm that finds the best-fit line

1. Let m = 0 and c = 0. Let L be our learning rate. It could be a

2. (a) Calculate the partial derivative of the Cost function with

► Perform Linear regression using OLS and Gradient Descent

You might also like