0% found this document useful (0 votes)

10 views

Supply Chain Analytics

The document discusses predictive analytics and its applications in supply chain analytics, specifically focusing on linear regression techniques. It provides a case example involving a wine producing company analyzing the impact of advertising expenditures on sales, including the use of Excel and XLMiner for regression analysis. The document also addresses the importance of p-values and cautions against model misspecification and overfitting in regression analysis.

Uploaded by

aishwarya anand

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

10 views

Supply Chain Analytics

Uploaded by

aishwarya anand

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 8

18/01/2023

BUSINESS ANALYTICS

SUPPLY CHAIN ANALYTICS

MBA & MBAA TERM-VI
(2022-23)

SESSION 3
PREDICTIVE ANALYTICS & ITS
APPLICATIONS

Dr. Devendra Kumar Pathak

(M.Tech. & Ph.D., IIT Delhi)
Assistant Professor, 2
Operations Management & Decision Sciences,
Indian Institute of Management (IIM) Kashipur

1 2

MACHINE LEARNING OBJECTIVES & TECHNIQUES LINEAR REGRESSION

 You own a ‘KBC’ wine producing company that uses

business analytics as its competitive advantage

 You would like to understand the effect of

advertising expenditures on sales for one of your
brands

3 4

1
18/01/2023

LINEAR REGRESSION: CASE EXAMPLE LINEAR REGRESSION: CASE EXAMPLE

SCATTER PLOT
First Year
Sales First Year
Advertising
Regio Sales
Expenditure You are planning to start
n ($ million)
s ($ million) selling in a new region next
A 101.8 1.3 year. What is your estimate of
B 44.4 0.7 expected first year sales in this
C 108.3 1.4
new region if you plan to
D 85.1 0.5
E 77.1 0.5
spend $1.2M in advertising?
F 158.7 1.9
G 180.4 1.2
H 64.2 0.4 Average First Year Sales =
I 74.6 0.6 $101.5M
J 143.4 1.3
K 120.6 1.6
L 69.7 1
M 67.8 0.8
N 106.7 0.6 5 6
O 119.6 1.1

5 6

LINEAR REGRESSION: CASE EXAMPLE LINEAR REGRESSION: CASE EXAMPLE

SCATTER PLOT SCATTER PLOT

First Year Sales & Advertising Data First Year Sales & Advertising Data
200 200
180 180
160 160
First Year Sales

First Year Sales

140 140
($ million)

($ million)

120 120
100 100
Average First Year
80 80
Sales = $101.5M
60 60
40 40
20 20
0 0
0 0.2 0.4 0.6 0.8 1 1.2 1.4 1.6 1.8 2 0 0.2 0.4 0.6 0.8 1 1.2 1.4 1.6 1.8 2
7 8
First Year Advertising Expenditures ($ million) First Year Advertising Expenditures ($ million)

7 8

2
18/01/2023

LINEAR REGRESSION: CASE EXAMPLE LINEAR REGRESSION: CASE EXAMPLE

Linear Regression How can we find the best line?
 How can we estimate the intercept (b0) and slope (b1)?

 To model sales (Y) as a linear function of advertising

expenditures (x), plus some random deviations (ε) First Year Sales & Advertising Data
[residual] 200
180 b1
160
Y = β0+ β1x + ε

First Year Sales

140

($ million)
120
100
80
Dependent Variable Predictor Variable (IDV) 60
40
20
 We must estimate the unknown parameters β0 and β1 . b0
9 0 10
 We will call these estimates b0 and b1, respectively 0 0.2 0.4 0.6 0.8 1 1.2 1.4 1.6 1.8 2
First Year Advertising Expenditures ($ million)

9 10

LINEAR REGRESSION: CASE EXAMPLE LINEAR REGRESSION: CASE EXAMPLE

How can we find the best line? Use of Excel to solve it!

 Data > Data Analysis > Regression

 Regression estimate for Y at xi (prediction):
Ŷi = b0+ b1xi
 Select data cells for your Y (sales) data and x (advertising
expenditures) data
 Residuals (“error” in prediction): ei = yi – ŷi
Use of XLMiner to solve it!
 Choose b0 and b1 to minimize sum of squared
residuals (or “errors”)  Data Mining > Prediction > Linear Regression

 Select data cells for your Y (sales) data and x (advertising

expenditures) data
11 12

11 12

3
18/01/2023

LINEAR REGRESSION: CASE EXAMPLE LINEAR REGRESSION: CASE EXAMPLE

13 14

LINEAR REGRESSION: CASE EXAMPLE LINEAR REGRESSION: CASE EXAMPLE

 Regression Output: Equation

15 16
Sales = 42.2 + 59.7 * advertising expenditures

15 16

4
18/01/2023

LINEAR REGRESSION: CASE EXAMPLE LINEAR REGRESSION: CASE EXAMPLE

Best linear regression Best linear regression
Line Fit Plot Line Fit Plot
200 200
Y
180 180
Predicted Y
160 160
Linear (Predicted Y)
140 140
120 120
100 Y 100
Y

Y
80 Predicted Y 80 ei
= 10185.6
60 Linear (Predicted Y) 60
40 40
20 20
0 0
0 0.5 1 1.5 2 0 0.5 1 1.5 2
x Variable Error ?? 17 x Variable 18

Sales (Y) = 42.2 + 59.7 * advertising expenditures (x) Sales (Y) = 42.2 + 59.7 * advertising expenditures (x)

17 18

LINEAR REGRESSION: CASE EXAMPLE LINEAR REGRESSION: CASE EXAMPLE

How good is our sales prediction?
NAÏVE ESTIMATION

First Year Sales & Advertising Data

200
180
160
First Year Sales

140
($ million)

120
100
80 ei
60
SSE naive= 20405
40
20
0
0 0.2 0.4 0.6 0.8 1 1.2 1.4 1.6 1.8 2
19 20
First Year Advertising Expenditures ($ million) R2 is a measure of the overall quality of the regression. It is the proportion
Average first year sales = $101.5M of the variance in the dependent variable that is predicted from the
independent variable.

19 20

5
18/01/2023

LINEAR REGRESSION: CASE EXAMPLE LINEAR REGRESSION: CASE EXAMPLE

How good are our b0 and b1 predictions?  What if we had more data? Can we do even better?

What is your estimate of

expected first year sales
in the new region if you
plan to spend $1.2M in
advertising, $0.3M in
Confidence promotions, and the
Intervals competitors’ sales are
$20M?

❖ Recall that b0 and b1 are estimates of β0 and β1

21 22
❖ Interpretation: Our estimate for β1 is b1 = 59.7; we are 95%
certain that the true value of β1 will be in between 24.0 and 95.4.

21 22

LINEAR REGRESSION: CASE EXAMPLE LINEAR REGRESSION: CASE EXAMPLE

Multiple Linear Regression

 Idea is to model sales (Y) as a linear function of

multiple “features” (x1, x2, …, xk), plus some random
deviations (ε)

Y = β0+ β1x1+ β2x2+…+ βkxk + ε

 We must estimate the unknown parameters β0, β1,

β2,…,βk
23 24

23 24

6
18/01/2023

LINEAR REGRESSION: CASE EXAMPLE LINEAR REGRESSION: CASE EXAMPLE

Regression Output: Equation R2 Revisited

25 R2 has increased from 0.50 to 0.83. 26

Sales = 65.7 + 49.0 * advertising expenditures + 59.7 * promotions In fact, R2 will always increase when an additional feature is added.
expenditures - 1.8 * competitors’ sales Does this imply that we should keep adding more features?

25 26

LINEAR REGRESSION: CASE EXAMPLE LINEAR REGRESSION: CASE EXAMPLE

Regression Output
Adding an additional feature: Average Annual Snowfall

R2 has increased
from 0.83 to 0.86

27 28

Is average annual snowfall really a good predictor of sales?

27 28

7
18/01/2023

LINEAR REGRESSION: CASE EXAMPLE LINEAR REGRESSION: SIGNIFICANCE OF P-VALUE

Regression Output
How to Interpret the P-values in Linear Regression
Analysis?

 The p-value for each term tests the null hypothesis

that the coefficient is equal to zero (no effect).
 A low p-value (< 0.05) indicates that you can reject the null
 Our estimate for the impact of one more inch of snow on sales hypothesis.
is $0.3M; we are 95% certain that the true value is between -
$0.2M and $0.8M.  In other words, a predictor that has a low p-value is
likely to be a meaningful addition to your model because
 Zero is in this confidence interval, which implies that there is changes in the predictor's value are related to changes in the
a good chance that snow has NO effect on sales. response variable.
 Associated with p-value > 0.05
 p-values < 0.05 ➔ variable is significant in prediction  Conversely, a larger (insignificant) p-value suggests that
changes in the predictor are not associated with changes in
 Adding this feature has artificially inflated R2 the response variable.
 Example of overfitting 29 30

 Variable selection – make sure no confidence intervals

contain 0.

29 30

LINEAR REGRESSION: CAUTIONS

 Model misspecification
 May be due to Left out variables
 May be due to irrelevant variables
 Functional Misspecification: What if the underlying
relationship between x and Y is not linear? [The Ramsey
Regression Specification Error Test (RESET)]

 Extrapolation
 Extending the model beyond the domain of available data

 Variable selection
 Exclude irrelevant variables to avoid overfitting (will
result in confidence interval containing 0)
 Exclude highly correlated variables (may also result in31
confidence intervals containing 0)

Russell Ovans Gameanalytics Retention
No ratings yet
Russell Ovans Gameanalytics Retention
23 pages
Inferensi Disekitar Mean Dan Pos Hoc-Zahro
No ratings yet
Inferensi Disekitar Mean Dan Pos Hoc-Zahro
11 pages
Analytics Compendium
No ratings yet
Analytics Compendium
41 pages
E 74 - 06 - For Force Measuring Instruments
No ratings yet
E 74 - 06 - For Force Measuring Instruments
12 pages
MIS_BA_20232024_notes_chapter03
No ratings yet
MIS_BA_20232024_notes_chapter03
13 pages
Linear RegressionSV
No ratings yet
Linear RegressionSV
66 pages
IS4242 W3 Regression Analyses
No ratings yet
IS4242 W3 Regression Analyses
67 pages
Module 3: Demand Forecasting: Unit 5: Linear Regression Forecasting
No ratings yet
Module 3: Demand Forecasting: Unit 5: Linear Regression Forecasting
9 pages
Forecasting 2nd III 17
No ratings yet
Forecasting 2nd III 17
4 pages
Regressions Courses
No ratings yet
Regressions Courses
84 pages
Da On Regression
No ratings yet
Da On Regression
58 pages
Slides - Simple Linear Regression
No ratings yet
Slides - Simple Linear Regression
35 pages
AA3 - Linear Regression - 2024
No ratings yet
AA3 - Linear Regression - 2024
26 pages
lecture 9-10
No ratings yet
lecture 9-10
28 pages
Linear Regression: What Is Regression Analysis?
100% (1)
Linear Regression: What Is Regression Analysis?
21 pages
Machine Learning and Linear Regression
100% (1)
Machine Learning and Linear Regression
55 pages
DA-MODULE-3
No ratings yet
DA-MODULE-3
54 pages
What Is Linear Regression
No ratings yet
What Is Linear Regression
14 pages
Predicting Pregnancies of Our Customers I - Regression Model
No ratings yet
Predicting Pregnancies of Our Customers I - Regression Model
50 pages
Regression, Correlation Analysis and Chi-Square Analysis
0% (1)
Regression, Correlation Analysis and Chi-Square Analysis
39 pages
ML 3 1
No ratings yet
ML 3 1
60 pages
Lecture 3
No ratings yet
Lecture 3
47 pages
STA200 - Lab Session - Chapter 14
No ratings yet
STA200 - Lab Session - Chapter 14
29 pages
Notes - Part II
No ratings yet
Notes - Part II
49 pages
Data Analysis - Part II
No ratings yet
Data Analysis - Part II
48 pages
Lecture 4
No ratings yet
Lecture 4
62 pages
Linear Regression PDF
100% (1)
Linear Regression PDF
32 pages
Presentation Business Applications
No ratings yet
Presentation Business Applications
18 pages
Intro to reg models
No ratings yet
Intro to reg models
27 pages
Linear Regression
No ratings yet
Linear Regression
97 pages
COMM5005 Lecture 8
No ratings yet
COMM5005 Lecture 8
54 pages
Lecture6 Regression
No ratings yet
Lecture6 Regression
42 pages
DA-3rd unit
No ratings yet
DA-3rd unit
16 pages
StatLearning2r PDF
No ratings yet
StatLearning2r PDF
267 pages
AAI Lecture 10 Sp 25
No ratings yet
AAI Lecture 10 Sp 25
37 pages
STATG5 - Simple Linear Regression Using SPSS Module
No ratings yet
STATG5 - Simple Linear Regression Using SPSS Module
16 pages
01 SLR Final
No ratings yet
01 SLR Final
37 pages
Module III (Part II)(Regression and Time Series)
No ratings yet
Module III (Part II)(Regression and Time Series)
118 pages
Notes - Part II - Watermark
No ratings yet
Notes - Part II - Watermark
49 pages
Linear Regression
No ratings yet
Linear Regression
38 pages
Linear Regression
No ratings yet
Linear Regression
15 pages
Chapter4_Regression.docx
No ratings yet
Chapter4_Regression.docx
15 pages
Group_1_Practical
No ratings yet
Group_1_Practical
16 pages
3. Linear Regression
No ratings yet
3. Linear Regression
49 pages
MODULE-3
No ratings yet
MODULE-3
34 pages
1.linear Regression PSP
No ratings yet
1.linear Regression PSP
92 pages
AI_Lec23
No ratings yet
AI_Lec23
36 pages
Chapter Simple Linear Regression 1
100% (1)
Chapter Simple Linear Regression 1
77 pages
Regression Analysis
No ratings yet
Regression Analysis
20 pages
Linear Regression
No ratings yet
Linear Regression
20 pages
Hanan
No ratings yet
Hanan
9 pages
Sbe10 10 Simple Regression
No ratings yet
Sbe10 10 Simple Regression
100 pages
Session 1: Simple Linear Regression: Figure 1 - Supervised and Unsupervised Learning Methods
No ratings yet
Session 1: Simple Linear Regression: Figure 1 - Supervised and Unsupervised Learning Methods
16 pages
10.Introduction to Artificial Intelligence
No ratings yet
10.Introduction to Artificial Intelligence
25 pages
Sesi 15. Regression
No ratings yet
Sesi 15. Regression
79 pages
unit5_R
No ratings yet
unit5_R
5 pages
CH 6. Simple Regression
No ratings yet
CH 6. Simple Regression
98 pages
Marketing Engineering Notes
No ratings yet
Marketing Engineering Notes
46 pages
Session 15 Regression and Correlation
No ratings yet
Session 15 Regression and Correlation
66 pages
Lecture 16
No ratings yet
Lecture 16
29 pages
Session 5 Marked B PDF
No ratings yet
Session 5 Marked B PDF
36 pages
Slides Marked As Extra Study Are Not As A Part of Syllabus. Those Are Provided For Add-On Knowledge
No ratings yet
Slides Marked As Extra Study Are Not As A Part of Syllabus. Those Are Provided For Add-On Knowledge
45 pages
Business Intelligence Questions, Analytical & Reporting Hint
From Everand
Business Intelligence Questions, Analytical & Reporting Hint
Dr. Zemelak Goraga
No ratings yet
QMSS - Group 4 - Sec B
No ratings yet
QMSS - Group 4 - Sec B
14 pages
SSM - Group 3
No ratings yet
SSM - Group 3
13 pages
Analysis
No ratings yet
Analysis
6 pages
Group 06 - OS
No ratings yet
Group 06 - OS
2 pages
Organizational Design JSW
No ratings yet
Organizational Design JSW
9 pages
Threat of New Entrant Customer Bargaining Power: Member-6 - Aishwarya Anand (MBA21073) 6
No ratings yet
Threat of New Entrant Customer Bargaining Power: Member-6 - Aishwarya Anand (MBA21073) 6
7 pages
Assignment 1
No ratings yet
Assignment 1
9 pages
Gr5205 Midterm Key
No ratings yet
Gr5205 Midterm Key
13 pages
Curve-Fitting and Interpolation
No ratings yet
Curve-Fitting and Interpolation
16 pages
Chapter 4 Exercise 10
No ratings yet
Chapter 4 Exercise 10
8 pages
Problem Set 2 SOLUTIONS
No ratings yet
Problem Set 2 SOLUTIONS
9 pages
Answer Key to Exercises_LN3_ver2
No ratings yet
Answer Key to Exercises_LN3_ver2
16 pages
Complete Answer Guide for Basic Econometrics 5th Edition Gujarati Solutions Manual
100% (7)
Complete Answer Guide for Basic Econometrics 5th Edition Gujarati Solutions Manual
43 pages
Trần Thị Thanh Thảo 31231025653
No ratings yet
Trần Thị Thanh Thảo 31231025653
5 pages
Mathematics: Answer Key
No ratings yet
Mathematics: Answer Key
5 pages
ESB2021 Resit With Solution
No ratings yet
ESB2021 Resit With Solution
9 pages
5ssmn932 Lecture5 2021 Slides Collated
No ratings yet
5ssmn932 Lecture5 2021 Slides Collated
78 pages
ANOVA
No ratings yet
ANOVA
2 pages
GOM Inspect Software Brochure 2017 en
No ratings yet
GOM Inspect Software Brochure 2017 en
16 pages
ML Unit 03 MCQ
No ratings yet
ML Unit 03 MCQ
20 pages
Multiple linear regression analysis with Stepwise method
No ratings yet
Multiple linear regression analysis with Stepwise method
5 pages
Fake estimates
No ratings yet
Fake estimates
3 pages
2326 - EC2020 - Main EQP v1 - Final
No ratings yet
2326 - EC2020 - Main EQP v1 - Final
19 pages
Docslide - Us - Lab Report Portal Frame
No ratings yet
Docslide - Us - Lab Report Portal Frame
13 pages
CHAPTER 5 & 6
No ratings yet
CHAPTER 5 & 6
139 pages
Lecture 5 Dummy Variable
No ratings yet
Lecture 5 Dummy Variable
11 pages
Tugas 2 Usm
No ratings yet
Tugas 2 Usm
6 pages
Least Square Method Definition
No ratings yet
Least Square Method Definition
7 pages
Sample 2 For Group Project Report
No ratings yet
Sample 2 For Group Project Report
25 pages
SurveyData 3
No ratings yet
SurveyData 3
49 pages
M1 Stat-701 SLR 2022
No ratings yet
M1 Stat-701 SLR 2022
17 pages
Math in Restaurants Full Lesson Final 4.17.12
No ratings yet
Math in Restaurants Full Lesson Final 4.17.12
64 pages
DA Unit-3
No ratings yet
DA Unit-3
11 pages

Supply Chain Analytics

Uploaded by

Supply Chain Analytics

Uploaded by

18/01/2023

SUPPLY CHAIN ANALYTICS

Dr. Devendra Kumar Pathak

MACHINE LEARNING OBJECTIVES & TECHNIQUES LINEAR REGRESSION

 You own a ‘KBC’ wine producing company that uses

 You would like to understand the effect of

LINEAR REGRESSION: CASE EXAMPLE LINEAR REGRESSION: CASE EXAMPLE

LINEAR REGRESSION: CASE EXAMPLE LINEAR REGRESSION: CASE EXAMPLE

SCATTER PLOT SCATTER PLOT

First Year Sales

LINEAR REGRESSION: CASE EXAMPLE LINEAR REGRESSION: CASE EXAMPLE

 To model sales (Y) as a linear function of advertising

First Year Sales

LINEAR REGRESSION: CASE EXAMPLE LINEAR REGRESSION: CASE EXAMPLE

 Data > Data Analysis > Regression

 Select data cells for your Y (sales) data and x (advertising

LINEAR REGRESSION: CASE EXAMPLE LINEAR REGRESSION: CASE EXAMPLE

LINEAR REGRESSION: CASE EXAMPLE LINEAR REGRESSION: CASE EXAMPLE

LINEAR REGRESSION: CASE EXAMPLE LINEAR REGRESSION: CASE EXAMPLE

LINEAR REGRESSION: CASE EXAMPLE LINEAR REGRESSION: CASE EXAMPLE

First Year Sales & Advertising Data

LINEAR REGRESSION: CASE EXAMPLE LINEAR REGRESSION: CASE EXAMPLE

What is your estimate of

❖ Recall that b0 and b1 are estimates of β0 and β1

LINEAR REGRESSION: CASE EXAMPLE LINEAR REGRESSION: CASE EXAMPLE

 Idea is to model sales (Y) as a linear function of

Y = β0+ β1x1+ β2x2+…+ βkxk + ε

 We must estimate the unknown parameters β0, β1,

LINEAR REGRESSION: CASE EXAMPLE LINEAR REGRESSION: CASE EXAMPLE

25 R2 has increased from 0.50 to 0.83. 26

LINEAR REGRESSION: CASE EXAMPLE LINEAR REGRESSION: CASE EXAMPLE

Is average annual snowfall really a good predictor of sales?

LINEAR REGRESSION: CASE EXAMPLE LINEAR REGRESSION: SIGNIFICANCE OF P-VALUE

 The p-value for each term tests the null hypothesis

 Variable selection – make sure no confidence intervals

LINEAR REGRESSION: CAUTIONS

You might also like