100% found this document useful (1 vote)

85 views

Linear Regression With Gradient Descent

This document discusses using gradient descent for linear regression. It defines the linear regression equation and loss function, and shows how to: 1) Initialize parameters for the linear regression model and split data into training and test sets. 2) Calculate the gradients of the loss function with respect to the model parameters. 3) Define an update rule to update the parameters using gradient descent. 4) Implement a gradient descent function to iteratively update the parameters, calculate loss, and return the optimized parameters.

Uploaded by

Gautam Kumar

Available Formats

Download as PDF, TXT or read online on Scribd

100% found this document useful (1 vote)

85 views

Linear Regression With Gradient Descent

Uploaded by

Gautam Kumar

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 8

13/06/2021 Linear_Reg_Gradient_Descent

Linear Reg with Gradient Descent

Now that we have understood how the gradient descent algorithm ﬁnds the optimal parameters of the model, in
this section, we will understand how can we use gradient descent in linear regression and ﬁnd the optimal
parameter.

The equation of a simple linear regression can be expressed as:

𝑦̂ = 𝑚𝑥 + 𝑏(1)

Thus, we have two parameters 𝑚 and 𝑏 . We will see how can we use gradient descent and ﬁnd the optimal
values for these two parameters 𝑚 and 𝑏 .

In [1]: # Import libraries

import pandas as pd
import numpy as np
from scipy.stats import pearsonr
import matplotlib.pyplot as plt

Data Preparation
References https://medium.com/data-science-365/linear-regression-with-gradient-descent-895bb7d18d52
(https://medium.com/data-science-365/linear-regression-with-gradient-descent-895bb7d18d52)

In [2]: df = pd.read_csv('Advertising.csv')
df.head()

Out[2]:
TV Radio Newspaper Sales

0 230.1 37.8 69.2 22.1

1 44.5 39.3 45.1 10.4

2 17.2 45.9 69.3 9.3

3 151.5 41.3 58.5 18.5

4 180.8 10.8 58.4 12.9

The Advertising dataset captures sales revenue generated with respect to advertisement spends across multiple
channels like radio, TV and newspaper. As you can see, there are four columns in the dataset. Since our
problem deﬁnition involves only sales and TV columns in the dataset, we do not need radio and newspaper
columns.

ﬁle:///Users/anishroychowdhury/Downloads/Linear_Reg_Gradient_Descent.html 1/8
13/06/2021 Linear_Reg_Gradient_Descent

In [3]: # drop unwanted columns

df.drop(columns=['Radio','Newspaper'],inplace=True)
df.head()

Out[3]:
TV Sales

0 230.1 22.1

1 44.5 10.4

2 17.2 9.3

3 151.5 18.5

4 180.8 12.9

In [4]: df.isnull().sum()

Out[4]: TV 0
Sales 0
dtype: int64

Parameter Initialization
We know that equation of a simple linear regression is expressed as:
𝑦̂ = 𝑚𝑥 + 𝑏

Thus, we have two parameters 𝑚 and 𝑏 . We store both of these parameter 𝑚 and 𝑏 in an array called theta.
First, we initialize theta with zeros

In [5]: # define X, Y , Theta

#---------------------------
N = df['Sales'].values.size # number of observations
x = np.append(np.ones((N,1)),df['TV'].values.reshape(N,1),axis=1)
y = df['Sales'].values.reshape(N,1)
theta = np.zeros((2,1))

In [6]: # test train split

from sklearn.model_selection import train_test_split

# split into train test sets

X_train, X_test, y_train, y_test = train_test_split(x, y, test_size=0.33
)

print(X_train.shape, X_test.shape, y_train.shape, y_test.shape)

(134, 2) (66, 2) (134, 1) (66, 1)

ﬁle:///Users/anishroychowdhury/Downloads/Linear_Reg_Gradient_Descent.html 2/8
13/06/2021 Linear_Reg_Gradient_Descent

Loss function
Mean Squared Error (MSE) of Regression is given as:
𝑁
1 2
𝐽 = ̂
(𝑦 − 𝑦) − −(2)
2𝑁 ∑
𝑖=1

Where 𝑁 is the number of training samples, 𝑦 is the actual value and 𝑦̂ is the predicted value.

The above loss function can be implemented as:

We feed the data and the model parameter theta to the loss function which returns the MSE. Remember,
data[,0] has 𝑥 value and data[,1] has 𝑦 value. Similarly, theta [0] has a value of 𝑚 and theta[1] has a value of 𝑏 .

In [7]: # define loss function

def mse_loss(x,y,theta):
y_pred = np.dot(x,theta)
sqrd_err = (y - y_pred)**2
loss = (1/(2*N))*np.sum(sqrd_err)
return loss

Now, we need to minimize this loss. In order to minimize the loss, we need to calculate the gradient of the loss
function 𝐽 with respect to the model parameters 𝑚 and 𝑏 and update the parameter according to the parameter
update rule. So, ﬁrst, we will calculate the gradient of the loss function.

Gradients of Loss Function

Gradients of loss function 𝐽 with respect to parameter 𝑚 is given as:
𝑁
𝑑𝐽 2
= −𝑥𝑖 (𝑦𝑖 − (𝑚𝑥𝑖 + 𝑏)) − −(3)
𝑑𝑚 𝑁 ∑
𝑖=1

Gradients of loss function 𝐽 with respect to parameter 𝑏 is given as:

𝑁
𝑑𝐽 2
= − (𝑦𝑖 − (𝑚𝑥𝑖 + 𝑏)) − −(4)
𝑑𝑏 𝑁 ∑
𝑖=1

ﬁle:///Users/anishroychowdhury/Downloads/Linear_Reg_Gradient_Descent.html 3/8
13/06/2021 Linear_Reg_Gradient_Descent

Update Rule
After computing gradients we need to update our model paramater according to our update rule as given
below:
𝑑𝐽
𝑚 = 𝑚 − 𝛼 − −(5)
𝑑𝑚

𝑑𝐽
𝑏 = 𝑏 − 𝛼 − −(6)
𝑑𝑏

Since we stored 𝑚 in theta[0] and 𝑏 in theta[1], we can write our update equation as:
𝑑𝐽
𝜃 = 𝜃 − 𝛼 − −(7)
𝑑𝜃

As we learned in the previous section, updating gradients for just one time will not lead us to the convergence
i.e minimum of the cost function, so we need to compute gradients and the update the model parameter for
several iterations:

Deﬁning the Gradient Descent function

In [8]: def Gradient_Desc(x,y,learn_rate=0.000008,num_iter=400000):

losses = []
theta = np.zeros((2,1))
for i in range(num_iter):
y_pred = np.dot(x,theta)
loss = mse_loss(x,y,theta)
der = np.dot(x.transpose(),(y_pred-y))/N
theta -= learn_rate*der
#print(theta)
losses.append(loss)

return theta, losses,learn_rate

MODEL TRAIN

In [9]: # Run the gradient descent on train data

theta, losses,alpha = Gradient_Desc(X_train,y_train)
print(theta)

[[3.10205147]
[0.06843237]]

Plot Loss curve

ﬁle:///Users/anishroychowdhury/Downloads/Linear_Reg_Gradient_Descent.html 4/8
13/06/2021 Linear_Reg_Gradient_Descent

In [10]: def plot_loss(losses,alpha):

fig, ax = plt.subplots(figsize = (9,6))
ax.plot(losses)
ax.set_title("Grad Descent loss values Vs iterations",pad=20,size=18
,color='gray')
ax.set_ylabel("losses for Learning rate =" + str(alpha))
ax.set_xlabel("Iterations")

In [11]: plot_loss(losses,alpha)

Predict and Accuracy check

In [12]: # Predict function

def predict(x,theta):
y_pred = np.dot(x,np.round(theta,3))
return y_pred

ﬁle:///Users/anishroychowdhury/Downloads/Linear_Reg_Gradient_Descent.html 5/8
13/06/2021 Linear_Reg_Gradient_Descent

In [13]: # Accuracy function

def get_accuracy(y_pred,y):
y_mean = y.mean()
# Coeff of determination - R squared : Pearson correlation is chosen
between y and y_pred
corrmat = np.corrcoef(y.flatten(),y_pred.flatten())
pearson_corr = corrmat[0,1]
R_sq = pearson_corr**2
# Mean Square Error
MSE = ((y - y_pred)**2).mean()
# Root Mean Square Error
RMSE = np.sqrt(MSE)
accu_dict = {"R_sq":[np.round(R_sq,2)],"RMSE":[np.round(RMSE,2)]}
return accu_dict

In [14]: def print_accuracy(accu_dict):

accu_df = pd.DataFrame(accu_dict)
pd.set_option('display.max_rows', None)
pd.set_option('display.max_columns', None)
pd.set_option('display.width', 1000)
pd.set_option('display.colheader_justify', 'center')
pd.set_option('display.precision', 2)
display(accu_df)

MODEL TEST

Train Accuracy

In [15]: # predict on train data

y_pred_train = predict(X_train,theta)
# get accuracy
accu_dict_train = get_accuracy(y_pred_train,y_train)
df1 = pd.DataFrame.from_dict(accu_dict_train)
display(df1)

R_sq RMSE

0 0.63 3.72

Test Accuracy

ﬁle:///Users/anishroychowdhury/Downloads/Linear_Reg_Gradient_Descent.html 6/8
13/06/2021 Linear_Reg_Gradient_Descent

In [16]: # predict on test data

y_pred_test = predict(X_test,theta)
# get accuracy
accu_dict_test = get_accuracy(y_pred_test,y_test)
df2 = pd.DataFrame.from_dict(accu_dict_test)
display(df2)

R_sq RMSE

0 0.58 3.96

PLOT Regression line

ﬁle:///Users/anishroychowdhury/Downloads/Linear_Reg_Gradient_Descent.html 7/8
13/06/2021 Linear_Reg_Gradient_Descent

In [39]: ### PLOT Regression Line

%matplotlib inline
import seaborn as sns
import matplotlib.pyplot as plt

# get data for plot

xdata = X_test[:,1].flatten()
ydata = y_test.flatten()
y_pred = y_pred_test.flatten()

plt.style.use('ggplot')
fig,ax = plt.subplots(figsize = (9,6))
# plot scatter of raw data
sns.scatterplot(x=xdata,y=ydata,ax=ax,color='red')

#plot regression line of predicted values

# create regression equation string

theta0 = theta[0]
theta1 = theta[1]
reg_eqn = "y = " + str(theta0) + " + " + str(theta1)+"*X"
sns.lineplot(xdata,y_pred,ax=ax,label = reg_eqn,color='blue')

ax.set_title("Linear Regression fit with Gradient Descent")

ax.set_ylabel("Sales in 1000$")
ax.set_xlabel("TV advertising spend in $")
ax.legend(loc="upper left")

Out[39]: <matplotlib.legend.Legend at 0x7fcbca1aa0a0>

In [ ]:

ﬁle:///Users/anishroychowdhury/Downloads/Linear_Reg_Gradient_Descent.html 8/8

Rubric For Simulation Activity
50% (2)
Rubric For Simulation Activity
3 pages
IE317 FALL2022 Homework 1 Solutions
No ratings yet
IE317 FALL2022 Homework 1 Solutions
3 pages
Docs Slides Lecture11
No ratings yet
Docs Slides Lecture11
18 pages
Machine Learning Cheat Sheet
100% (1)
Machine Learning Cheat Sheet
211 pages
Building Powerful Predictive Scorecards 1991WP
No ratings yet
Building Powerful Predictive Scorecards 1991WP
46 pages
QuantEconlectures Python3 PDF
100% (1)
QuantEconlectures Python3 PDF
1,125 pages
Sequence of Asanas
No ratings yet
Sequence of Asanas
7 pages
ML Cheatsheet
No ratings yet
ML Cheatsheet
1 page
Machine Learning Andrew NG Week 6 Quiz 1
No ratings yet
Machine Learning Andrew NG Week 6 Quiz 1
8 pages
Machine Learning Coursera All Exercies PDF
No ratings yet
Machine Learning Coursera All Exercies PDF
117 pages
Slide 3 - Linear Regression One Variable
No ratings yet
Slide 3 - Linear Regression One Variable
60 pages
Slide 4 - Linear Regression With Multiple Variables
100% (1)
Slide 4 - Linear Regression With Multiple Variables
30 pages
Machine Learning Andrew NG Week 5 Quiz 1
No ratings yet
Machine Learning Andrew NG Week 5 Quiz 1
3 pages
Regularization: The Problem of Overfitting
No ratings yet
Regularization: The Problem of Overfitting
24 pages
Optimization Convex Concave Example
No ratings yet
Optimization Convex Concave Example
5 pages
Unit Test (Practice) - Kinematics (Oct 2008 Version) Solutions
No ratings yet
Unit Test (Practice) - Kinematics (Oct 2008 Version) Solutions
6 pages
Model Perf Cheat Sheet
No ratings yet
Model Perf Cheat Sheet
2 pages
Logistic Regression
100% (1)
Logistic Regression
21 pages
Poly
100% (1)
Poly
108 pages
Lead Scoring Group Case Study Presentation
100% (2)
Lead Scoring Group Case Study Presentation
19 pages
Churn Modeling
100% (1)
Churn Modeling
11 pages
Data Science With R
100% (1)
Data Science With R
6 pages
1694600777-Unit2.2 Logistic Regression CU 2.0
100% (1)
1694600777-Unit2.2 Logistic Regression CU 2.0
37 pages
Unit 4 - Linear Regression
No ratings yet
Unit 4 - Linear Regression
52 pages
Midterm Solutions Machine
100% (1)
Midterm Solutions Machine
17 pages
Regression
No ratings yet
Regression
46 pages
Sph3u Examreview
No ratings yet
Sph3u Examreview
2 pages
Regression Analysis
100% (2)
Regression Analysis
9 pages
Machine Learning Advanced
100% (2)
Machine Learning Advanced
12 pages
LDA 01 Linear Discriminant Analysis
No ratings yet
LDA 01 Linear Discriminant Analysis
65 pages
Customer Segmentation Clustering
No ratings yet
Customer Segmentation Clustering
35 pages
Predictive Model For E-Commerce
100% (1)
Predictive Model For E-Commerce
3 pages
Optimization Based On Gradient Descent
No ratings yet
Optimization Based On Gradient Descent
24 pages
Cheatsheet Midterms 2 - 3
No ratings yet
Cheatsheet Midterms 2 - 3
2 pages
HW1
100% (1)
HW1
8 pages
Deployment: Cheat Sheet: Machine Learning With KNIME Analytics Platform
No ratings yet
Deployment: Cheat Sheet: Machine Learning With KNIME Analytics Platform
1 page
ML MU Unit 2
100% (3)
ML MU Unit 2
84 pages
Chapter 5.3-Mulitple Linear Regression
No ratings yet
Chapter 5.3-Mulitple Linear Regression
26 pages
### Data Exploration: 'Yes' 'No' 'Agency' 'Direct' 'Employee Referral' 'Yes' 'No'
100% (1)
### Data Exploration: 'Yes' 'No' 'Agency' 'Direct' 'Employee Referral' 'Yes' 'No'
6 pages
Generalized Additive Model
No ratings yet
Generalized Additive Model
10 pages
Maths For Machine Learning
No ratings yet
Maths For Machine Learning
47 pages
Logistic Regression: Gunjan Bharadwaj Assistant Professor Dept of CEA
100% (1)
Logistic Regression: Gunjan Bharadwaj Assistant Professor Dept of CEA
42 pages
Lead Scoring Case Study Presentation
100% (2)
Lead Scoring Case Study Presentation
11 pages
Classification and Regression Trees
100% (1)
Classification and Regression Trees
60 pages
The Box-Jenkins Methodology For RIMA Models
No ratings yet
The Box-Jenkins Methodology For RIMA Models
172 pages
Machine Learning Guide Line
No ratings yet
Machine Learning Guide Line
10 pages
Polynomial Regression and Step Function
100% (1)
Polynomial Regression and Step Function
6 pages
Logistic Regression Example
100% (1)
Logistic Regression Example
22 pages
01.multiple Linear Regression - Ipynb - Colaboratory
No ratings yet
01.multiple Linear Regression - Ipynb - Colaboratory
10 pages
Project 5 PDF
100% (1)
Project 5 PDF
48 pages
GAM: The Predictive Modeling Silver Bullet: Author: Kim Larsen
No ratings yet
GAM: The Predictive Modeling Silver Bullet: Author: Kim Larsen
27 pages
Homework 2
100% (1)
Homework 2
14 pages
Wine Case Report
100% (2)
Wine Case Report
16 pages
Weather Forecasting Basepaper
100% (1)
Weather Forecasting Basepaper
14 pages
Epilepsy - Definition, Classification, Pathophysiology, and Epidemiology
No ratings yet
Epilepsy - Definition, Classification, Pathophysiology, and Epidemiology
7 pages
Learning Path Machine Learning
No ratings yet
Learning Path Machine Learning
7 pages
Oil Export Indonesia
100% (1)
Oil Export Indonesia
12 pages
A Machine Learning Framework For Sport Result Prediction
No ratings yet
A Machine Learning Framework For Sport Result Prediction
7 pages
Lecture3_upload
No ratings yet
Lecture3_upload
28 pages
L02 Linear Regression
No ratings yet
L02 Linear Regression
9 pages
DeepLearning Lect2 3
No ratings yet
DeepLearning Lect2 3
89 pages
Profound Python Data Science
From Everand
Profound Python Data Science
Onder Teker
No ratings yet
The Complexity of Culture
No ratings yet
The Complexity of Culture
3 pages
Culture's Definition: N.E.Myridis
No ratings yet
Culture's Definition: N.E.Myridis
3 pages
Neural Networks MATH Explained
No ratings yet
Neural Networks MATH Explained
14 pages
FM 351 - Getting - Started
No ratings yet
FM 351 - Getting - Started
20 pages
Tactics and Fallacies of Illuminati
No ratings yet
Tactics and Fallacies of Illuminati
9 pages
Workout Thesis Statement
100% (3)
Workout Thesis Statement
8 pages
Future Board Game
No ratings yet
Future Board Game
2 pages
Environmental Education and Training
No ratings yet
Environmental Education and Training
47 pages
Mindray M9 SVM
No ratings yet
Mindray M9 SVM
271 pages
Gender Equality
No ratings yet
Gender Equality
6 pages
Michel Deza, Serguei Shpectorov and Mathieu Dutour-Sikiric - Wythoff Construction and l1 - Embedding
No ratings yet
Michel Deza, Serguei Shpectorov and Mathieu Dutour-Sikiric - Wythoff Construction and l1 - Embedding
53 pages
Kunzvi Dam Report
No ratings yet
Kunzvi Dam Report
7 pages
PHYS115 Sp24 Lecture05 ViscousFluids PRE
No ratings yet
PHYS115 Sp24 Lecture05 ViscousFluids PRE
22 pages
Ral Communication Mix
No ratings yet
Ral Communication Mix
26 pages
Em330 Em340 Et330 Et340 CP
No ratings yet
Em330 Em340 Et330 Et340 CP
19 pages
Anti-Theft Immobilizer
No ratings yet
Anti-Theft Immobilizer
43 pages
Stoneridge, Inc., Control Devices Division
No ratings yet
Stoneridge, Inc., Control Devices Division
4 pages
What Are The Guiding Questions?
No ratings yet
What Are The Guiding Questions?
2 pages
Ch-6 My childhood
No ratings yet
Ch-6 My childhood
2 pages
Publi 4156
No ratings yet
Publi 4156
170 pages
Furuno Installation Manual
71% (7)
Furuno Installation Manual
35 pages
Maximum and Minimum PDF
No ratings yet
Maximum and Minimum PDF
3 pages
Linac
No ratings yet
Linac
2 pages
SAP MM Procurement Type
No ratings yet
SAP MM Procurement Type
109 pages
NIA Project Registration and PEA Document
No ratings yet
NIA Project Registration and PEA Document
6 pages
Matlab Simulink Based
No ratings yet
Matlab Simulink Based
9 pages
The Revised Blooms Taxonomy ASSESSMENT
No ratings yet
The Revised Blooms Taxonomy ASSESSMENT
21 pages
PDF Detailed Lesson Plan Adjectives
No ratings yet
PDF Detailed Lesson Plan Adjectives
3 pages
Political Cleavages in Serbia - Changes and Continuities in Structuring Left-Right Orientations (2010)
No ratings yet
Political Cleavages in Serbia - Changes and Continuities in Structuring Left-Right Orientations (2010)
21 pages
Schneider Georgakis 2013 How To NOT Make The Extended Kalman Filter Fail
No ratings yet
Schneider Georgakis 2013 How To NOT Make The Extended Kalman Filter Fail
29 pages
Chicago Pneumatic CPS 250 Diesel Air Compressor Flyer
No ratings yet
Chicago Pneumatic CPS 250 Diesel Air Compressor Flyer
2 pages
PCS PRELIMS 2021 QUESTION PAPER
No ratings yet
PCS PRELIMS 2021 QUESTION PAPER
52 pages