0% found this document useful (0 votes)

16 views

Simple_and_Multiple_Regression

Uploaded by

Vipin Gautam

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

16 views

Simple_and_Multiple_Regression

Uploaded by

Vipin Gautam

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 9

In [1]:

import pandas as pd
import numpy as np
import matplotlib.pyplot as plt
import seaborn as sns

In [2]:
#Reading the dataset
#dataset = pd.read_csv("https://raw.githubusercontent.com/Satyajeet-IITDelhi/sales/main/SLRSales.csv")

In [3]:
#Reading the dataset
dataset = pd.read_csv("C:/NeuralNetwork/MRMSL861/SLRSales.csv")

In [4]:
dataset.head()

Out[4]: Sales Adv_Exp

0 43.6 13.9

1 38.0 12.0

2 30.1 9.3

3 35.3 9.7

4 46.4 12.3

In [5]:
#Model Building
#Simple Linear Regresion
from sklearn.model_selection import train_test_split
from sklearn.linear_model import LinearRegression
from sklearn import metrics

In [6]:
#Setting the value for X and Y
x = dataset[['Adv_Exp']]
y = dataset['Sales']
In [7]:
x_train, x_test, y_train, y_test = train_test_split(x, y, test_size = 0.3, random_state = 100)

In [8]:
slr= LinearRegression()
slr.fit(x_train, y_train)

Out[8]: LinearRegression()

In [9]:
#Printing the model coefficients
print('Intercept: ', slr.intercept_)
print('Coefficient:', slr.coef_)

Intercept: 14.462716405605931
Coefficient: [2.08367683]

In [10]:
print('Regression Equation: Sales = 14.46 + 2.08 * Adv_Exp')

Regression Equation: Sales = 14.46 + 2.08 * Adv_Exp

In [11]:
import statsmodels.api as sm

In [12]:
#fit linear regression model
model = sm.OLS(y, x).fit()

In [13]:
#view model summary
print(model.summary())

OLS Regression Results

=======================================================================================
Dep. Variable: Sales R-squared (uncentered): 0.990
Model: OLS Adj. R-squared (uncentered): 0.990
Method: Least Squares F-statistic: 1140.
Date: Thu, 06 Jul 2023 Prob (F-statistic): 1.84e-12
Time: 16:19:36 Log-Likelihood: -32.310
No. Observations: 12 AIC: 66.62
Df Residuals: 11 BIC: 67.11
Df Model: 1
Covariance Type: nonrobust
==============================================================================
coef std err t P>|t| [0.025 0.975]
------------------------------------------------------------------------------
Adv_Exp 3.2395 0.096 33.762 0.000 3.028 3.451
==============================================================================
Omnibus: 0.341 Durbin-Watson: 2.699
Prob(Omnibus): 0.843 Jarque-Bera (JB): 0.445
Skew: 0.288 Prob(JB): 0.801
Kurtosis: 2.253 Cond. No. 1.00
==============================================================================

Notes:
[1] R² is computed without centering (uncentered) since the model does not contain a constant.
[2] Standard Errors assume that the covariance matrix of the errors is correctly specified.
C:\Users\Satyajeet\anaconda3\lib\site-packages\scipy\stats\_stats_py.py:1736: UserWarning: kurtosistest only valid for n>=20 ... c
ontinuing anyway, n=12
warnings.warn("kurtosistest only valid for n>=20 ... continuing "

Multiple Linear Regression (MLR)

In [14]:
#Importing the libraries
import pandas as pd
import numpy as np
import matplotlib.pyplot as plt
import seaborn as sns

In [15]:
#Reading the dataset
dataset = pd.read_csv("https://raw.githubusercontent.com/Harshita0109/Sales-Prediction/master/advertising.csv")

In [16]:
dataset.head()

Out[16]: TV Radio Newspaper Sales

0 230.1 37.8 69.2 22.1

1 44.5 39.3 45.1 10.4

2 17.2 45.9 69.3 12.0

3 151.5 41.3 58.5 16.5

TV Radio Newspaper Sales

4 180.8 10.8 58.4 17.9

In [17]:
#Exploratory Data Analysis
#Distribution of the target variable
sns.distplot(dataset['Sales']);

C:\Users\Satyajeet\anaconda3\lib\site-packages\seaborn\distributions.py:2557: FutureWarning: `distplot` is a deprecated function a

In [18]:
#Exploratory Data Analysis
#Distribution of the Independent variable(IV)
sns.distplot(dataset['TV']);

C:\Users\Satyajeet\anaconda3\lib\site-packages\seaborn\distributions.py:2557: FutureWarning: `distplot` is a deprecated function a

nd will be removed in a future version. Please adapt your code to use either `displot` (a figure-level function with similar flexi
bility) or `histplot` (an axes-level function for histograms).
warnings.warn(msg, FutureWarning)
In [19]:
#Exploratory Data Analysis
#Distribution of the Independent variable(IV)
sns.distplot(dataset['Radio']);

C:\Users\Satyajeet\anaconda3\lib\site-packages\seaborn\distributions.py:2557: FutureWarning: `distplot` is a deprecated function a

nd will be removed in a future version. Please adapt your code to use either `displot` (a figure-level function with similar flexi
bility) or `histplot` (an axes-level function for histograms).
warnings.warn(msg, FutureWarning)
In [20]:
#Exploratory Data Analysis
#Distribution of the Independent variable(IV)
sns.distplot(dataset['Newspaper']);

C:\Users\Satyajeet\anaconda3\lib\site-packages\seaborn\distributions.py:2557: FutureWarning: `distplot` is a deprecated function a

In [21]:
#Heatmap
sns.heatmap(dataset.corr(), annot = True)
plt.show()
In [22]:
#Multiple Linear Regression(MLR)
#Equation: Sales = β0 + (β1 * TV) + (β2 * Radio) + (β3 * Newspaper)
#Setting the value for X and Y
x = dataset[['TV', 'Radio', 'Newspaper']]
y = dataset['Sales']

In [23]:
x_train, x_test, y_train, y_test= train_test_split(x, y, test_size= 0.3, random_state=100)

In [24]:
mlr= LinearRegression()
mlr.fit(x_train, y_train)

Out[24]: LinearRegression()

In [25]:
#Printing the model coefficients
print(mlr.intercept_)
# pair the feature names with the coefficients
list(zip(x, mlr.coef_))

4.334595861728431
Out[25]: [('TV', 0.053829108667250075),
('Radio', 0.11001224388558056),
('Newspaper', 0.006289950146130346)]
In [26]:
import statsmodels.api as sm

In [27]:
#fit linear regression model
model = sm.OLS(y, x).fit()

In [28]:
#view model summary
print(model.summary())

OLS Regression Results

=======================================================================================
Dep. Variable: Sales R-squared (uncentered): 0.977
Model: OLS Adj. R-squared (uncentered): 0.977
Method: Least Squares F-statistic: 2826.
Date: Thu, 06 Jul 2023 Prob (F-statistic): 1.35e-161
Time: 16:22:13 Log-Likelihood: -460.08
No. Observations: 200 AIC: 926.2
Df Residuals: 197 BIC: 936.1
Df Model: 3
Covariance Type: nonrobust
==============================================================================
coef std err t P>|t| [0.025 0.975]
------------------------------------------------------------------------------
TV 0.0671 0.002 42.078 0.000 0.064 0.070
Radio 0.1600 0.011 14.154 0.000 0.138 0.182
Newspaper 0.0284 0.008 3.545 0.000 0.013 0.044
==============================================================================
Omnibus: 0.114 Durbin-Watson: 1.949
Prob(Omnibus): 0.945 Jarque-Bera (JB): 0.025
Skew: 0.026 Prob(JB): 0.987
Kurtosis: 3.020 Cond. No. 12.6
==============================================================================

Notes:
[1] R² is computed without centering (uncentered) since the model does not contain a constant.
[2] Standard Errors assume that the covariance matrix of the errors is correctly specified.

In [ ]:
In [ ]:

Unit 6 Amir PPT
No ratings yet
Unit 6 Amir PPT
8 pages
Electrical Engineering
100% (2)
Electrical Engineering
2 pages
Linear Regression
No ratings yet
Linear Regression
7 pages
Regression Anallysis Hands0n 1
100% (1)
Regression Anallysis Hands0n 1
3 pages
Assignment_Solution_1
No ratings yet
Assignment_Solution_1
11 pages
TP Regression
100% (1)
TP Regression
1 page
Pregunta 5
No ratings yet
Pregunta 5
2 pages
assignment2
No ratings yet
assignment2
5 pages
TestExercise 3.ipynb - Colab
No ratings yet
TestExercise 3.ipynb - Colab
8 pages
Regressao Linear Simples - Ipynb - Colaboratory
100% (1)
Regressao Linear Simples - Ipynb - Colaboratory
2 pages
Python_Codes_Regression - Jupyter Notebook
No ratings yet
Python_Codes_Regression - Jupyter Notebook
7 pages
ML Lab6.Ipynb - Colaboratory
100% (1)
ML Lab6.Ipynb - Colaboratory
5 pages
Week 2 MrSumanBera HandsOn
No ratings yet
Week 2 MrSumanBera HandsOn
9 pages
Bda Assign
No ratings yet
Bda Assign
15 pages
Data_Analysis_Report
No ratings yet
Data_Analysis_Report
16 pages
Regressao Linear Multipla - Ipynb - Colaboratory
No ratings yet
Regressao Linear Multipla - Ipynb - Colaboratory
2 pages
Simple Linear Regression
No ratings yet
Simple Linear Regression
5 pages
CE1 Sol
No ratings yet
CE1 Sol
7 pages
BA Soln
No ratings yet
BA Soln
9 pages
vertopal.com_Lab_Linear_Regression
No ratings yet
vertopal.com_Lab_Linear_Regression
21 pages
Chapter 2
No ratings yet
Chapter 2
39 pages
Arima Model
No ratings yet
Arima Model
6 pages
Zerox Ready
No ratings yet
Zerox Ready
21 pages
Tarea de Ma
No ratings yet
Tarea de Ma
18 pages
ML - Lab-6.ipynb - Colab
No ratings yet
ML - Lab-6.ipynb - Colab
4 pages
Da 5
No ratings yet
Da 5
3 pages
Lab4 - SLR - Ipynb - Colaboratory
No ratings yet
Lab4 - SLR - Ipynb - Colaboratory
7 pages
Kata Pengantar Vano
No ratings yet
Kata Pengantar Vano
86 pages
Data Science Chapitre 2
No ratings yet
Data Science Chapitre 2
98 pages
predictive modelling outputs
No ratings yet
predictive modelling outputs
7 pages
7
No ratings yet
7
5 pages
Output - Group - Work - Project - 4652 - GWP1.ipynb - Colaboratory
No ratings yet
Output - Group - Work - Project - 4652 - GWP1.ipynb - Colaboratory
6 pages
Data Science Chapitre 2
No ratings yet
Data Science Chapitre 2
132 pages
19BCS2059 DL1
No ratings yet
19BCS2059 DL1
4 pages
5
No ratings yet
5
3 pages
5
No ratings yet
5
3 pages
PPP Models - GARCH & NARNN - Ipynb - Colaboratory
No ratings yet
PPP Models - GARCH & NARNN - Ipynb - Colaboratory
13 pages
Simpreg
No ratings yet
Simpreg
6 pages
MLR-handson - Jupyter Notebook
No ratings yet
MLR-handson - Jupyter Notebook
5 pages
OLSLinear Regquestion
No ratings yet
OLSLinear Regquestion
5 pages
Coding Activity 3.ipynb - Colaboratory
No ratings yet
Coding Activity 3.ipynb - Colaboratory
7 pages
PPP Models - ARIMA & NARNN - Ipynb - Colaboratory
No ratings yet
PPP Models - ARIMA & NARNN - Ipynb - Colaboratory
8 pages
211423205047-Exp1c
No ratings yet
211423205047-Exp1c
6 pages
A028 GLM-SC3
No ratings yet
A028 GLM-SC3
137 pages
Section 2
No ratings yet
Section 2
22 pages
How to Perform Simple Linear Regression in Python
No ratings yet
How to Perform Simple Linear Regression in Python
8 pages
Regression Prac 9
No ratings yet
Regression Prac 9
8 pages
MachineLearning
No ratings yet
MachineLearning
10 pages
Popularity Prediction On Twitter EE239AS Project 3
No ratings yet
Popularity Prediction On Twitter EE239AS Project 3
21 pages
Assignment: Topic - Testing For Violation of OLS Assumptions
No ratings yet
Assignment: Topic - Testing For Violation of OLS Assumptions
50 pages
Exercise 4: Simple and Multiple Linear Regression Analysis
No ratings yet
Exercise 4: Simple and Multiple Linear Regression Analysis
15 pages
Regression Model
No ratings yet
Regression Model
6 pages
S9 Regresión Simple y Múltiple Al - Colaboratory
No ratings yet
S9 Regresión Simple y Múltiple Al - Colaboratory
14 pages
PPP Models - SARIMA & NARNN - Ipynb - Colaboratory
No ratings yet
PPP Models - SARIMA & NARNN - Ipynb - Colaboratory
8 pages
2 Linear Regression
No ratings yet
2 Linear Regression
5 pages
Ilovepdf Merged
No ratings yet
Ilovepdf Merged
47 pages
Lab 4
No ratings yet
Lab 4
7 pages
Linear Regression - Numpy and Sklearn
No ratings yet
Linear Regression - Numpy and Sklearn
7 pages
Lab 5
No ratings yet
Lab 5
6 pages
6
No ratings yet
6
3 pages
Korelasi Parsial
No ratings yet
Korelasi Parsial
4 pages
Profound Python Data Science
From Everand
Profound Python Data Science
Onder Teker
No ratings yet
MS Vent Piping (07 7 11)
No ratings yet
MS Vent Piping (07 7 11)
10 pages
Amanuel Thesis Determinants Proposal
No ratings yet
Amanuel Thesis Determinants Proposal
50 pages
Opman Uae
No ratings yet
Opman Uae
11 pages
How To Understand Women in Relationships
No ratings yet
How To Understand Women in Relationships
2 pages
Lecture note 5
No ratings yet
Lecture note 5
8 pages
Structure and Material of A Pre-Filled Syringe
No ratings yet
Structure and Material of A Pre-Filled Syringe
36 pages
Obtaining Construction Permits in Cameroon Dealing With The Law
No ratings yet
Obtaining Construction Permits in Cameroon Dealing With The Law
3 pages
Assignment BJT Sem 2 20202021
No ratings yet
Assignment BJT Sem 2 20202021
4 pages
Creative Arts and Designs
No ratings yet
Creative Arts and Designs
136 pages
ICT1 Course Outline - BIT AY 2021-2022
No ratings yet
ICT1 Course Outline - BIT AY 2021-2022
3 pages
Capsule of Basic Numerology
No ratings yet
Capsule of Basic Numerology
39 pages
Problem 6.41: Solution
No ratings yet
Problem 6.41: Solution
3 pages
Geotechnical Challenges in Permafrost Regions
No ratings yet
Geotechnical Challenges in Permafrost Regions
13 pages
Quest For A Tool Measuring Urban Quality of Life: ISO 37120 Standard Sustainable Development Indicators
No ratings yet
Quest For A Tool Measuring Urban Quality of Life: ISO 37120 Standard Sustainable Development Indicators
17 pages
MA211 Week 3 Lecture 2 Chapter 13.2
No ratings yet
MA211 Week 3 Lecture 2 Chapter 13.2
11 pages
Enabling Deep Learning Using Synthetic Data Wiring Harness Case Study
No ratings yet
Enabling Deep Learning Using Synthetic Data Wiring Harness Case Study
7 pages
Description and Application: 100%CO /80%ar - 20%CO EN ISO 17632-A-T 46 3 P C/M 1 H5 AWS A5.20 E71T-1C/1M,-9C/9M
No ratings yet
Description and Application: 100%CO /80%ar - 20%CO EN ISO 17632-A-T 46 3 P C/M 1 H5 AWS A5.20 E71T-1C/1M,-9C/9M
1 page
Department of Education: Republic of The Philippines
No ratings yet
Department of Education: Republic of The Philippines
36 pages
Principles of Medical Genetics
No ratings yet
Principles of Medical Genetics
3 pages
Math 9 2nd Quarter
No ratings yet
Math 9 2nd Quarter
5 pages
Teaching Assistanship Bbs
No ratings yet
Teaching Assistanship Bbs
19 pages
Giáo án tiếng Anh lớp 9 Global success 1 cột bản chuẩn 12.4. TEST YOURSELF 4.docx
No ratings yet
Giáo án tiếng Anh lớp 9 Global success 1 cột bản chuẩn 12.4. TEST YOURSELF 4.docx
9 pages
Ideju Miesttas Lithuania Pif 2023
No ratings yet
Ideju Miesttas Lithuania Pif 2023
4 pages
Speech Production and Speech Perception
No ratings yet
Speech Production and Speech Perception
21 pages
Elements of News: Top Ten Elements That Make A Story Newsworthy
No ratings yet
Elements of News: Top Ten Elements That Make A Story Newsworthy
12 pages
Thesis Front Page Sample
100% (2)
Thesis Front Page Sample
6 pages
Course Guide EDTECH 212
No ratings yet
Course Guide EDTECH 212
4 pages
Clamp: H As Accurately As Possible. You May Draw A Diagram
No ratings yet
Clamp: H As Accurately As Possible. You May Draw A Diagram
29 pages

Simple_and_Multiple_Regression

Uploaded by

Simple_and_Multiple_Regression

Uploaded by

In [1]:

Out[4]: Sales Adv_Exp

Regression Equation: Sales = 14.46 + 2.08 * Adv_Exp

OLS Regression Results

Multiple Linear Regression (MLR)

Out[16]: TV Radio Newspaper Sales

0 230.1 37.8 69.2 22.1

1 44.5 39.3 45.1 10.4

2 17.2 45.9 69.3 12.0

3 151.5 41.3 58.5 16.5

4 180.8 10.8 58.4 17.9

C:\Users\Satyajeet\anaconda3\lib\site-packages\seaborn\distributions.py:2557: FutureWarning: `distplot` is a deprecated function a

C:\Users\Satyajeet\anaconda3\lib\site-packages\seaborn\distributions.py:2557: FutureWarning: `distplot` is a deprecated function a

C:\Users\Satyajeet\anaconda3\lib\site-packages\seaborn\distributions.py:2557: FutureWarning: `distplot` is a deprecated function a

C:\Users\Satyajeet\anaconda3\lib\site-packages\seaborn\distributions.py:2557: FutureWarning: `distplot` is a deprecated function a

OLS Regression Results

You might also like