0% found this document useful (0 votes)

2 views

STA200 - Lab Session - Chapter 14

Chapter 14 covers Simple Linear Regression, highlighting its appropriateness for predicting one variable based on another with a linear relationship using one independent variable. It outlines key assumptions for reliable regression results, including linearity, independence, homoscedasticity, and normality of errors. The chapter also includes case studies demonstrating regression analysis with practical examples and Excel steps for conducting regression and checking assumptions.

Uploaded by

xzelex22

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

2 views

STA200 - Lab Session - Chapter 14

Uploaded by

xzelex22

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 29

Chapter 14:

Simple Linear Regression

Lab Session
www.aum.edu.kw
When is Simple Linear Regression
Appropriate?
Simple linear regression is useful when:
● You want to predict one variable based on another (e.g.,
profit from investment).
● You believe there is a linear relationship between the two
variables.
● The predictor variable is continuous (e.g., hours of employee
training, advertising spend).
● Only one independent variable is used.

www.aum.edu.kw
Assumptions of the Regression Model
For the regression results to be reliable, four key assumptions must be satisfied:
● Linearity: The relationship between the dependent and independent variable
must be linear.
● Independence: The residuals (errors) are independent of each other.
● Homoscedasticity: The variance of residuals is constant across all levels of the
independent variable.
● Normality of Errors: The residuals should be approximately normally distributed.
Notes:
● ෡𝑖 (Observed value – predicted value)
Residuals (errors) = 𝑌𝑖 − 𝑌
● Violation of these assumptions can lead to biased or misleading results.
www.aum.edu.kw
Possible Regression Lines in Simple Linear Regression

www.aum.edu.kw 4
Key Points
Regression Equation: Y = β1 X + β0
● 𝐑² is the Coefficient of Determination which represents the % of the variation in Y
that is explained by X.
● 𝐑 is the correlation coefficient which measures the strength and direction of the
linear relationship between the independent variable X and the dependent variable
Y
● 𝛃𝟏 : The coefficient of X
● 𝛃𝟎 : The coefficient of the intercepts
● 𝐘𝐢 is the observed value of Y
● 𝐘෡𝐢 is the predicted value
● 𝐗 𝐢 is the observed value of X
● p-value < 𝟎. 𝟎𝟓 → statistically significant → there is evidence of a linear
relationship. www.aum.edu.kw
www.aum.edu.kw 6
Excel Steps for Regression Assumptions and Output

1) For Regression Output :

Run the Regression Using Data Analysis ToolPak
1. Go to Data → Data Analysis (If not available, enable it via Excel Options →
Add-ins → ToolPak).
2. Select Regression and click OK.
3. Fill in the fields:
■ Input Y Range: Select your Y Data

■ Input X Range: Select your X Data

■ Check Labels (if you included headers)

■ Select Output Range (or New Worksheet)

www.aum.edu.kw
2) For Regression Assumptions:
a) Linearity:
1. Create a scatterplot of the observed Y values against the X values.
2. Add a trendline: Right-click a data point → Add Trendline → Choose "Linear“.

b) Independence:
1. After running regression, calculate residuals:
○ Add a column for Predicted Y using the regression equation.
○ Subtract Predicted Y from Actual Y.
○ Residual = Actual Y – Predicted Y
2. Create a scatterplot of residuals vs. observation order:
○ X-axis = Observation number (Add a new column called "Observation Order" (1 to n)).
○ Y-axis = Residual www.aum.edu.kw 8
2) For Regression Assumptions:
c) Homoscedasticity:
1. After running regression, calculate residuals:
○ Add a column for Predicted Y using the regression equation.
○ Subtract Predicted Y from Actual Y.
○ Residual = Actual Y – Predicted Y
2. Create a scatterplot of residuals vs. predicted Y values:
○ X-axis = Predicted Y
○ Y-axis = Residual

www.aum.edu.kw 9
2) For Regression Assumptions:
d) Normality of Residuals:
Create a histogram of residuals:
1. After running regression, calculate residuals:
○ Add a column for Predicted Y using the regression equation.
○ Subtract Predicted Y from Actual Y.
○ Residual = Actual Y – Predicted Y
2. Click anywhere in the residual column (or select the full range, e.g., C2:C36).
3. Go to the top menu and click:
○ Insert tab → In the Charts group → click on the Histogram icon (it's under the bar
chart dropdown).
4. Select “Histogram” from the options.
www.aum.edu.kw 10
Case Study 1:
Discount % vs. Weekly Profit
A retail business is analyzing how increasing discount percentages
affect its weekly profit.
Data for 35 observations:
Use the following data set to:
1) Comment on the regression
output
2) Write the regression
equation
3) Check the simple linear
regression assumptions
4) Predict the weekly profit if
the Discount applied is 25%.
www.aum.edu.kw
1) Regression output
Excel Output:

www.aum.edu.kw 12
1) Regression output Output Comments
• 𝐑2 = 𝟎. 𝟗𝟐𝟕𝟎 = 𝟗𝟐. 𝟕% which means that
Excel Output: 92.7% of the variation in the weekly profit is
explained by the discount %.
• 𝜷𝟏 = −𝟒𝟐 negative slope, which reflect the
negative linear relationship. In addition, 1 hour
increase in the discount % leads to $42
decrease in the weekly profit.
• 𝜷𝟎 = 𝟐𝟎𝟐𝟓
• p-value < 0.05 which means that the discount %
has significant effect on the weekly profit.
• 𝐑 = − 𝑅2 𝛽1 < 0 = − 0.9270 = +0.9628
strong negative linear relationship

www.aum.edu.kw 13
2) Regression Equation
Using the Regression output

Regression Equation:
෣Profit = −42 × (𝐷𝑖𝑠𝑐𝑜𝑢𝑛𝑡 %) + 2025
𝑊𝑒𝑒𝑘𝑙𝑦

www.aum.edu.kw 14
3) Checking the Regression Assumptions
Excel Output:

www.aum.edu.kw 15
4) Profit Prediction:
Predict the weekly profit if the Discount applied is 25%?

Regression Equation:
෣
Weely Profit = −42 × (𝐷𝑖𝑠𝑐𝑜𝑢𝑛𝑡 %) + 2025

If the discount % is 25, the profit will be:

෣
Weely Profit = −42 × (25) + 2025 = $975

www.aum.edu.kw 16
Case Study 2:
Ads per Week vs. Units Sold
A marketing team wants to examine whether running more online
ads leads to more product sales.
Data for 35 observations:
Use the following data set to:
1) Comment on the regression
output
2) Write the regression
equation
3) Check the simple linear
regression assumptions
4) Predict the Units Sold if the
number of Ads per week is
60.
www.aum.edu.kw
1) Regression output
Excel Output:

www.aum.edu.kw 18
1) Regression output Output Comments
• 𝐑2 = 𝟎. 𝟗𝟐𝟕𝟎 = 𝟗𝟐. 𝟕% which means that
Excel Output: 92.7% of the variation in the weekly units sold
is explained by the number of Ads per week.
• 𝜷𝟏 = 𝟐𝟐 positive slope, which reflect the
positive linear relationship. In addition, 1 Ads
increase per week leads to 22 units increase in
the weekly units sold.
• 𝜷𝟎 = 𝟑𝟎𝟎
• p-value < 0.05 which means that the number of
Ads per week has significant effect on the weekly
number of units sold.
• 𝐑 = + R² (𝛽1 > 0) = + 0.9270 = +0.9628
strong positive linear relationship

www.aum.edu.kw 19
2) Regression Equation
Using the Regression output

Regression Equation:
෣
Units Sold = 22 × (𝐴𝑑𝑠 𝑝𝑒𝑟 𝑤𝑒𝑒𝑘) + 300

www.aum.edu.kw 20
3) Checking the Regression Assumptions
Excel Output:

www.aum.edu.kw 21
4) Units Sold Prediction:
Predict the Units Sold if the number of Ads per week is 60?

Regression Equation:
෣
Units Sold = 22 × (𝐴𝑑𝑠 𝑝𝑒𝑟 𝑤𝑒𝑒𝑘) + 300

If the discount % is 25, the profit will be:

෣
Units Sold = 22 × 60 + 300 = 1620 𝑢𝑛𝑖𝑡𝑠

www.aum.edu.kw 22
Case Study 3:
Training Hours vs. Productivity Score
An HR department is assessing the impact of employee training
hours on productivity scores.
Data for 35 observations:
Use the following data set to:
1) Comment on the regression
output
2) Write the regression
equation
3) Check the simple linear
regression assumptions
4) Predict the productivity
score for 40 training hours.
www.aum.edu.kw
2) Regression output
Excel Output:

www.aum.edu.kw 24
2) Regression output Output Comments
• 𝐑2 = 𝟎. 𝟖𝟎𝟐𝟗 = 𝟖𝟎. 𝟐𝟗% which means that
Excel Output: 80.29% of the variation in the production
score is explained by the training hours.
• 𝜷𝟏 = 𝟓 positive slope, which reflect the
positive linear relationship. In addition, an
increase of 1 training hour leads the
productivity score to increase by 5 points.
• 𝜷𝟎 = 𝟔𝟔
• p-value < 0.05 which means that the training
hours has significant effect on the productivity
score.
• 𝐑 = + R² (𝛽1 > 0) = + 0.8029 = +0.8960
strong positive linear relationship

www.aum.edu.kw 25
3) Regression Equation
Using the Regression output

Regression Equation:
෣ Score = 5 × (𝑇𝑟𝑎𝑖𝑛𝑖𝑛𝑔 𝐻𝑜𝑢𝑟𝑠) + 66
Productivity

www.aum.edu.kw 26
3) Checking the Regression Assumptions
Excel Output:

www.aum.edu.kw 27
4) Productivity Score Prediction:
Predict the productivity score for 40 training hours?

Regression Equation:
෣ Score = 5 × (𝑇𝑟𝑎𝑖𝑛𝑖𝑛𝑔 𝐻𝑜𝑢𝑟𝑠) + 66
Productivity

If the discount % is 40, the profit will be:

෣ Score = 5 × (40) + 66 = 266
Productivity

www.aum.edu.kw 28
End of Session

www.aum.edu.kw 29

2023 Exam Memo
No ratings yet
2023 Exam Memo
10 pages
Fault Diagnosis - MAN CATs II PDF
0% (1)
Fault Diagnosis - MAN CATs II PDF
17 pages
Simple Liner REgression
No ratings yet
Simple Liner REgression
27 pages
STATG5 - Simple Linear Regression Using SPSS Module
No ratings yet
STATG5 - Simple Linear Regression Using SPSS Module
16 pages
Supply Chain Analytics
No ratings yet
Supply Chain Analytics
8 pages
Practical - Regression
No ratings yet
Practical - Regression
114 pages
Unit 8 Regression Analysis
No ratings yet
Unit 8 Regression Analysis
22 pages
LGT2425 Lecture 3 Part II (Notes)
No ratings yet
LGT2425 Lecture 3 Part II (Notes)
55 pages
Regression Analysis
No ratings yet
Regression Analysis
20 pages
Intro to reg models
No ratings yet
Intro to reg models
27 pages
Da On Regression
No ratings yet
Da On Regression
58 pages
Linear Regression Analysis in Excel 2
No ratings yet
Linear Regression Analysis in Excel 2
15 pages
Regression Practice Questions
No ratings yet
Regression Practice Questions
19 pages
Engineering - Simple Correlation and Regression - 2024
No ratings yet
Engineering - Simple Correlation and Regression - 2024
35 pages
Ba All Notes Merge - Merged
No ratings yet
Ba All Notes Merge - Merged
385 pages
Sasin DECS 434 Session 4 - Rate of Change and Benchmarking
No ratings yet
Sasin DECS 434 Session 4 - Rate of Change and Benchmarking
52 pages
What Is Linear Regression
No ratings yet
What Is Linear Regression
14 pages
Chapter 18
No ratings yet
Chapter 18
25 pages
2023 Statistics Fin 10
No ratings yet
2023 Statistics Fin 10
14 pages
Model Development
No ratings yet
Model Development
80 pages
W6 - L6 - Multiple Linear Regression
No ratings yet
W6 - L6 - Multiple Linear Regression
3 pages
Simple Linear Regression: Coefficient of Determination
No ratings yet
Simple Linear Regression: Coefficient of Determination
21 pages
Econometrics
No ratings yet
Econometrics
18 pages
P4-FDA-B29-Monish Patle
No ratings yet
P4-FDA-B29-Monish Patle
14 pages
Statstic Slide
No ratings yet
Statstic Slide
24 pages
Correlation Regression - 2023
No ratings yet
Correlation Regression - 2023
69 pages
Multiple Regression
No ratings yet
Multiple Regression
25 pages
Presentation Business Applications
No ratings yet
Presentation Business Applications
18 pages
Predictive Analytics - Business Predictions Using Mutliple Linear Regression
No ratings yet
Predictive Analytics - Business Predictions Using Mutliple Linear Regression
21 pages
Linear Regression
100% (2)
Linear Regression
28 pages
Linear Regression. Com
No ratings yet
Linear Regression. Com
13 pages
Linear Regression Analysis in Excel Assingment
No ratings yet
Linear Regression Analysis in Excel Assingment
17 pages
Linear Regression Analysis in Excel
No ratings yet
Linear Regression Analysis in Excel
15 pages
2.3 Assumptions of Linear Regression
No ratings yet
2.3 Assumptions of Linear Regression
16 pages
Unit 5 and 6 - Inferential Statistics and Regression Analysis
No ratings yet
Unit 5 and 6 - Inferential Statistics and Regression Analysis
68 pages
Predective Analytics or Inferential Statistics
No ratings yet
Predective Analytics or Inferential Statistics
27 pages
Regrion
No ratings yet
Regrion
19 pages
A Tutorial On How To Run A Simple Linear Regression in Excel
No ratings yet
A Tutorial On How To Run A Simple Linear Regression in Excel
19 pages
Multiple Linear Regression Analysis Usin
No ratings yet
Multiple Linear Regression Analysis Usin
19 pages
Multiple Linear Regression in Excel
No ratings yet
Multiple Linear Regression in Excel
19 pages
Regression Analysis in Excel
No ratings yet
Regression Analysis in Excel
20 pages
CH 06
No ratings yet
CH 06
20 pages
Regression Using Excel
No ratings yet
Regression Using Excel
18 pages
Exploratory Data Analytics-1
No ratings yet
Exploratory Data Analytics-1
27 pages
Linear Regression Analysis in Excel
No ratings yet
Linear Regression Analysis in Excel
17 pages
BusStat W03 Simple Regression 1
No ratings yet
BusStat W03 Simple Regression 1
15 pages
Linear Regression Analysis in Excel
No ratings yet
Linear Regression Analysis in Excel
17 pages
Lecture 6 - Multiple Regression Analysis
No ratings yet
Lecture 6 - Multiple Regression Analysis
32 pages
Interpreting Correlation
No ratings yet
Interpreting Correlation
13 pages
Predicting Pregnancies of Our Customers I - Regression Model
No ratings yet
Predicting Pregnancies of Our Customers I - Regression Model
50 pages
DS Unit-Iv
No ratings yet
DS Unit-Iv
34 pages
Module 3: Demand Forecasting: Unit 5: Linear Regression Forecasting
No ratings yet
Module 3: Demand Forecasting: Unit 5: Linear Regression Forecasting
9 pages
UE20CS312 Unit2 Slides
No ratings yet
UE20CS312 Unit2 Slides
206 pages
How To Do Linear Regression With Excel
No ratings yet
How To Do Linear Regression With Excel
8 pages
Assignment On Regression
100% (1)
Assignment On Regression
11 pages
Lecture Notes
No ratings yet
Lecture Notes
141 pages
W6 - L5 - Assumptions of Regression
No ratings yet
W6 - L5 - Assumptions of Regression
4 pages
Chapter 14 An Introduction To Multiple Linear Regression: No. of Emails % Discount Sales Customer X X y
No ratings yet
Chapter 14 An Introduction To Multiple Linear Regression: No. of Emails % Discount Sales Customer X X y
9 pages
Regression Analysis Using Excel
100% (1)
Regression Analysis Using Excel
85 pages
BA unit 2 notes (1)
No ratings yet
BA unit 2 notes (1)
5 pages
Linear and Nonlinear Programming Essentials
From Everand
Linear and Nonlinear Programming Essentials
Tanushri Kaniyar
No ratings yet
Multi-dimensional Monte Carlo Integrations Utilizing Mathematica
From Everand
Multi-dimensional Monte Carlo Integrations Utilizing Mathematica
SUJAUL CHOWDHURY
No ratings yet
Nuvole Bianchi - Ludovico Einaudi - Sheet Music For Piano (Solo) Musescore - Com 2
No ratings yet
Nuvole Bianchi - Ludovico Einaudi - Sheet Music For Piano (Solo) Musescore - Com 2
1 page
Communities - Virtual vs. Real - Science
No ratings yet
Communities - Virtual vs. Real - Science
5 pages
Violet Clean Out Unit: Electronics
No ratings yet
Violet Clean Out Unit: Electronics
2 pages
InBatch 21CFR Part 11 Deployment Guide 2016
No ratings yet
InBatch 21CFR Part 11 Deployment Guide 2016
56 pages
PROF ED 108: Technology For Teaching and Learning
No ratings yet
PROF ED 108: Technology For Teaching and Learning
43 pages
my spectacular and boring no phone sundays - Vera Hester
No ratings yet
my spectacular and boring no phone sundays - Vera Hester
3 pages
SO3 A1 Wordlist
No ratings yet
SO3 A1 Wordlist
124 pages
Depth-Peeling For Texture-Based Volume Rendering
No ratings yet
Depth-Peeling For Texture-Based Volume Rendering
5 pages
Ophelia SlidesCarnival
No ratings yet
Ophelia SlidesCarnival
29 pages
B. Tech ECE Electronics Computer Engineering 2018 Pattern
No ratings yet
B. Tech ECE Electronics Computer Engineering 2018 Pattern
7 pages
ECE Course List
No ratings yet
ECE Course List
12 pages
Dotnet Developer Questions
No ratings yet
Dotnet Developer Questions
2 pages
Modelling
No ratings yet
Modelling
1,161 pages
Dissertation Examples International Business
100% (1)
Dissertation Examples International Business
4 pages
Hive Partitions
No ratings yet
Hive Partitions
5 pages
Unit 2 B Lesson 2, Pg229-230 Questions
No ratings yet
Unit 2 B Lesson 2, Pg229-230 Questions
2 pages
MULTISER-xx-PC-TFT Manual
No ratings yet
MULTISER-xx-PC-TFT Manual
34 pages
Sneak Peak BCTCI - Sliding Windows & Binary Search
No ratings yet
Sneak Peak BCTCI - Sliding Windows & Binary Search
60 pages
Case Study Silvus Land and Sea Demo v2.0
No ratings yet
Case Study Silvus Land and Sea Demo v2.0
5 pages
CPM PT 1
No ratings yet
CPM PT 1
199 pages
The Rise of Quantum Computing
No ratings yet
The Rise of Quantum Computing
2 pages
Computers & Me: Matching: Write The Names Under The Correct Pictures
No ratings yet
Computers & Me: Matching: Write The Names Under The Correct Pictures
9 pages
Check Zone in 7ss52-53 - Iec
No ratings yet
Check Zone in 7ss52-53 - Iec
2 pages
Personalizing Dialogue Agents: I Have A Dog, Do You Have Pets Too?
No ratings yet
Personalizing Dialogue Agents: I Have A Dog, Do You Have Pets Too?
16 pages
Catalog: Printer Consumables
100% (1)
Catalog: Printer Consumables
18 pages
Mock Exam 3
No ratings yet
Mock Exam 3
27 pages
Information Assurance Security in the Information Environment 2nd edition by Andrew Blyth, Gerald Kovacich ISBN 1846282667 978-1846282669 download
100% (1)
Information Assurance Security in the Information Environment 2nd edition by Andrew Blyth, Gerald Kovacich ISBN 1846282667 978-1846282669 download
47 pages
Emerson, Society and Solitude
No ratings yet
Emerson, Society and Solitude
317 pages

STA200 - Lab Session - Chapter 14

Uploaded by

STA200 - Lab Session - Chapter 14

Uploaded by

Chapter 14:

Simple Linear Regression

1) For Regression Output :

■ Input X Range: Select your X Data

■ Check Labels (if you included headers)

■ Select Output Range (or New Worksheet)

If the discount % is 25, the profit will be:

If the discount % is 25, the profit will be:

If the discount % is 40, the profit will be:

You might also like