Chapter 7 Presentation - 11.18.2024

This document covers regression analysis, a statistical method for examining relationships between variables, focusing on linear regression and its applications. It explains the method of least squares for estimating regression coefficients, calculating slope and intercept, and measuring variability in results. Additionally, it contrasts regression analysis with correlation analysis, highlighting their respective purposes and methods of quantification.

Uploaded by

Sunny He

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

9 views18 pages

Chapter 7 Presentation - 11.18.2024

Uploaded by

Sunny He

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 18

Statistics and Probability

CIVE 224
Chapter 7
Regression Analysis
Regression Analysis
- A statistical method to examine the Types of Regression:
relationship between two or more - Linear (single/multiple
variables. independent variables)
- The purpose is to understand how the - Nonlinear: exponential, power,
ONE dependent variable (Y) changes logarithmic, etc.
when any one of the independent
variables (X) changes, while the other Applications:
independent variables are held fixed. - Predicting “Y” based on “X”
- Determine the strength of
- Regression helps predicting predicators “Xi”, which X is the
dependent variable with help of most reliable to predict “Y”
independent variable
- Trends analysis: trends overtime
Linear Regression
Squaring residuals ensures that both
The Method of Least Square: positive & negative differences add to
A standard approach in regression the overall error and that larger errors
analysis to approximate the solution, are penalized more heavily.
where there are more equations than
unknowns (intercept & slope). The goal of least-square regression is to
find the values of a and b that minimize
Such system is called: Over-determined ∑(yobserved − ypredicted)2
Used to find best-fit line of a data set by
minimizing the sum of the squares of Mathematically: data set (Xi, Yi), i = n
difference “residual” between observed The linear regression is described by:
and predicted values
𝑌 𝑋𝑖 = 𝑎 + 𝑏𝑋𝑖
- Residual: the differences between
observed and predicted values
Linear Regression
𝑛
𝑌 𝑋𝑖 = 𝑎 + 𝑏𝑋𝑖
Least –square regression, Total Error (E)
𝐸 = ෍[𝑦𝑖 − 𝑎 + 𝑏𝑥𝑖 2 ]
𝑖=1
𝑛

𝐸 = ෍[𝑦𝑖 − 𝑌(𝑥𝑖)2 ] The goal: find slope (a) & intercept (b) that
𝑖=1
minimize (E) to ensure the best fit to the
data
𝑛
- 𝑦𝑖: observed value of dependent
variable min ෍[𝑦𝑖 − 𝑎 + 𝑏𝑥𝑖 2 ]
𝑎,𝑏
- Y(xi): predicted value “Y” for “Xi” 𝑖=1
calculated using the equation
- [𝑦𝑖 − 𝑌 𝑥𝑖 2 ] : square residual for
each data point
Linear Regression
𝑛
We calculate partial derivatives of E
min ෍[𝑦𝑖 − 𝑎 + 𝑏𝑥𝑖 2 ] with respect to a and b, then set them
𝑎,𝑏 to zero (this finds the minimum of EEE).
𝑖=1
With respect to “a”
Which are derived by setting the partial
derivatives of the sum of squared 𝜕𝐸
residuals with respect to each
= −2 σ𝑛𝑖=1[𝑦𝑖 − 𝑎 + 𝑏𝑥𝑖 ] = 0
𝜕𝑎
parameter (slope and intercept) to zero. Where the derivative of (−a) is (−1)
𝜕𝐸 𝜕𝐸 Equivalently:
= 0 𝑎𝑛𝑑 =0 𝑛
𝜕𝑎 𝜕𝑏
෍ 𝑦𝑖 − 𝑎 + 𝑏𝑥𝑖 =0
𝑖=1
Equation - 1
Linear Regression
Partial derivative With respect to “b” Equivalently:

𝑦𝑖 − 𝑎 + 𝑏𝑥𝑖 = −𝑥𝑖 𝑛
“a” is treated as constant with respect ෍ 𝑦𝑖 − 𝑎 + 𝑏𝑥𝑖 ∗ 𝑥𝑖 = 0
to “b”. 𝑖=1
𝑛

෍ 2(−𝑥𝑖) 𝑦𝑖 − 𝑎 + 𝑏𝑥𝑖 Equation - 2

𝑖=1
Solving equations 1&2 gives the optimal
𝑛 intercept and slope values that
minimize the sum of squared residuals,
−2 ෍ 𝑦𝑖 − 𝑎 + 𝑏𝑥𝑖 ∗ 𝑥𝑖 = 0 thereby giving us the line of best fit.
𝑖=1
Estimating Linear Regression Coefficients
Involves finding the values of a & b in a 2. Calculate the Slope:
regression model that minimize the sum of
squared residuals (the differences between
observed and predicted values) σ𝑛
𝑖=1 𝑥𝑖 −𝑥ҧ ∗(𝑦𝑖 −𝑦ҧ)
b= σ𝑛 2
𝑌 𝑋 = 𝑎 + 𝑏𝑋 𝑖=1 𝑥𝑖 −𝑥ҧ

1. Calculate the means of X and Y: This formula is derived from the least
squares criterion, ensuring that the line
𝑛
1 minimizes the sum of squared residuals
𝑥ҧ = ෍ 𝑥𝑖
𝑛 3. Calculate the intercept (a):
𝑖=1
𝑎 = 𝑦ത − 𝑏𝑥ҧ
𝑛
1 σ 𝑦𝑖 − 𝑏 ∗ σ 𝑥𝑖
𝑦ҧ = ෍ 𝑦𝑖 𝑎=
𝑛
𝑖=1 𝑛
Example b=
σ𝑛
𝑖=1 𝑥𝑖 −𝑥ҧ ∗(𝑦𝑖 −𝑦ҧ)
σ𝑛𝑖=1 𝑥𝑖 −𝑥ҧ
2
For the following data set, calculate the
slope and intercept for the best-fit line 1. Calculate X average: 3
2. Calculate X Deviation: -2,-1,0,1,2
X Y
3. Calculate Y average: 4
1 2
4. Calculate Y deviation: -2,0,1,0,1
2 4
5. σ𝑛𝑖=1 𝑥𝑖 − 𝑥ҧ ∗ (𝑦𝑖 − 𝑦ҧ) : 6
3 5
6. Calculate σ𝑛𝑖=1 𝑥𝑖 − 𝑥ҧ 2 = 10
4 4
7. b = 0.6
5 5
Example σ 𝑦𝑖 − 𝑏 ∗ σ 𝑥𝑖
𝑎=
For the following data set, calculate the 𝑛
slope and intercept for the best-fit line
1. Calculate σ 𝑥𝑖 : 15
X Y 2. Calculate b* σ 𝑥𝑖 : 9
1 2 3. Calculate σ 𝑦𝑖 : 20
2 4 4. a = 2.2
3 5
𝑌 𝑋 = 𝑎 + 𝑏𝑋
4 4
5 5 𝑌 𝑋 = 2.2 + 0.6𝑋
Measuring the Variability of Results
Total Variability

➢ “Sy” measures the overall spread of Y

values around the mean “𝑦ҧ”
➢ How much of the variability in “Y”
can be attributed to its relationship
with X Sy calculates how much variation exists in the
➢ versus how much is just random dependent variable y, which is useful for
variation around the mean. assessing the fit of a regression model.

𝑆𝑦 =
𝑆𝑦𝑦 ↑ Sys high,
=

Syy = ∑(yi−𝑦ത )2
𝑛−1 Total sum of squares for the dependent
variable
Measuring the Variability of Results
Variability About the Regression Line How to calculate (SY∣X) the variability
(SY∣X) – Standard Error of Estimate: about the regression line?

- SY∣X Measures how well the regression 𝑆𝑦𝑦 −𝑏𝑆𝑥𝑦

line fits the data. 𝑆(𝑌 ∣ 𝑋) =
𝑛 −2
- SY∣X is small: points are close to the
regression line The formula is derived from the
- 𝑦(x)
ො the variability of observed Y value residuals of the regression and
around the regression line predicted measures the spread of the data points
for each X value: around the fitted regression line.

𝑦ො 𝑥 = 𝑎 + 𝑏𝑥
Measuring the Variability of Results
Calculate SY∣X using the formula SXY: The sum of the multiplication of the
deviations of X and Y from their means
𝑆𝑦𝑦 −𝑏∗𝑆𝑥𝑦 𝑛
𝑆(𝑌 ∣ 𝑋) =
𝑛 −2 𝑆𝑋𝑌 = ෍ 𝑥𝑖 − 𝑥ҧ ∗ 𝑦𝑖 − 𝑦ത
𝑖=1
Syy: the total sum of squares for the
dependent variable Y (total variation) b: Regression line slope

𝑛 𝑆𝑥𝑦 σ𝑛
𝑖=1 𝑥𝑖 −𝑥ҧ ∗(𝑦𝑖 −𝑦ҧ)
b= = σ𝑛 2
𝑆𝑦𝑦 = ෍ 𝑦𝑖 − 𝑦ത 2 𝑆𝑥𝑥 𝑖=1 𝑥𝑖 −𝑥ҧ

𝑖=1
n: number of observations
Correlation vs. Regression Analysis
Correlation measures the:
Regression Analysis:
▪ Strength
Concerned with predicting the LEVEL of
relationship between dependent ▪ Direction
variable Y for independent variable X of the linear relationship between two
variables.
Correlation Analysis:
Often quantified by the Pearson
Concerned with the STRENGTH of correlation coefficient
relationship between Y and X
Ranges from -1 (perfect negative
correlation) to +1 (perfect positive
correlation), with 0 indicating no linear
relationship.
Correlation vs. Regression
Correlation vs. Regression
Correlation
𝑛
Sample Correlation Coefficient:
𝑆𝑥𝑥 = ෍ 𝑥𝑖 − 𝑥ҧ 2
𝑆𝑥𝑦
𝑟= 𝑖=1
𝑆𝑥𝑥 ∗ 𝑆𝑦𝑦
𝑛 σ𝑛𝑖=1[ 𝑥𝑖 − 𝑥ҧ ∗ 𝑦𝑖 − 𝑦ത ]
𝑟= 𝑛
σ𝑖=1 𝑥𝑖 − 𝑥ҧ ∗ σ𝑛𝑖=1 𝑦𝑖 − 𝑦ത
𝑆𝑋𝑌 = ෍ 𝑥𝑖 − 𝑥ҧ ∗ 𝑦𝑖 − 𝑦ത
𝑖=1
𝑛 To obtain the Population Correlation
Coefficient "𝜌”, we use the Fisher Z
2
𝑆𝑦𝑦 = ෍ 𝑦𝑖 − 𝑦ത transformation
𝑖=1
Example
Data for Pressure and flow rate: - Calculate Intercept (a):
σ yi−b∗σ xi
Pressure (x) 5, 6, 7, 8, 9, and 10 a= = -58.14
n
Flowrate (y) 14, 25, 70, 85, 49, and 105
- Line of Best Fit:
Calculate the line-best fit, correlation Y = −58.14 + 15.49X
coefficient (r), and explain what does it
mean.
- Calculate Correlation Coefficient (r):
- Calculate Slope (b): σ𝑛𝑖=1[ 𝑥𝑖 − 𝑥ҧ ∗ 𝑦𝑖 − 𝑦ത ]
σ𝑛 𝑟= 𝑛
𝑖=1 𝑥𝑖 −𝑥ҧ ∗(𝑦𝑖 −𝑦ҧ) σ𝑖=1 𝑥𝑖 − 𝑥ҧ ∗ σ𝑛𝑖=1 𝑦𝑖 − 𝑦ത
b= σ𝑛 2 = -15.49
𝑖=1 𝑥𝑖 −𝑥ҧ
271.0
𝑟= = 0.824
329.1
Example
What r Represents:
• The correlation coefficient (r) measures the strength and direction of the
linear relationship between pressure and flowrate:
• r = 0.824 indicates a strong positive correlation, meaning as the pressure
increases, the flowrate tends to increase as well.
• The value is close to 1, suggesting that the data points are relatively well
aligned with the line of best fit.

LP-III Lab Manual
No ratings yet
LP-III Lab Manual
49 pages
Statistics Mini Project Roll No 211 To 220 NEW 2 0
100% (1)
Statistics Mini Project Roll No 211 To 220 NEW 2 0
28 pages
Module 3 Regression Notes
No ratings yet
Module 3 Regression Notes
3 pages
Math562TB 06F PDF
No ratings yet
Math562TB 06F PDF
701 pages
Biomedical Literature Evaluation
No ratings yet
Biomedical Literature Evaluation
27 pages
Advanced Mathematics C B Gupta Ak Mallik V Kumar Download
No ratings yet
Advanced Mathematics C B Gupta Ak Mallik V Kumar Download
86 pages
Simple Linear Regressionclassroom
No ratings yet
Simple Linear Regressionclassroom
37 pages
Module On CPE 198 Research Method: Prepared By: Arlene C. Dolotallas, PH.D
No ratings yet
Module On CPE 198 Research Method: Prepared By: Arlene C. Dolotallas, PH.D
95 pages
J. K.Shah Classes Regression Analysis
No ratings yet
J. K.Shah Classes Regression Analysis
21 pages
(Ebook PDF) Statistics For Nursing: A Practical Approach 3rd Editioninstant Download
100% (4)
(Ebook PDF) Statistics For Nursing: A Practical Approach 3rd Editioninstant Download
51 pages
Slides - Simple Linear Regression
No ratings yet
Slides - Simple Linear Regression
35 pages
Regression Analysis MCQ
No ratings yet
Regression Analysis MCQ
15 pages
Pearsonstable PDF
No ratings yet
Pearsonstable PDF
1 page
BRM Research Article Final 1
No ratings yet
BRM Research Article Final 1
41 pages
Regression Analysis MCQ
No ratings yet
Regression Analysis MCQ
15 pages
Effects of Using Tagalog As Medium of Instruction in Teaching Grade 6 Science
No ratings yet
Effects of Using Tagalog As Medium of Instruction in Teaching Grade 6 Science
16 pages
SPSS Practical
No ratings yet
SPSS Practical
31 pages
9 Regression (Statistics IEM 2-2)
No ratings yet
9 Regression (Statistics IEM 2-2)
32 pages
Statistics For Support Slides
No ratings yet
Statistics For Support Slides
186 pages
9 Regression (Statistics IEM 2-2)
No ratings yet
9 Regression (Statistics IEM 2-2)
32 pages
Impact of Water Scarcity On Socio-Economic Develop
No ratings yet
Impact of Water Scarcity On Socio-Economic Develop
13 pages
Regression & Correlation
No ratings yet
Regression & Correlation
44 pages
Simple Linear Regression
No ratings yet
Simple Linear Regression
27 pages
GE14B
No ratings yet
GE14B
107 pages
13 Predictive Analysis - Tests of Association - Regression
No ratings yet
13 Predictive Analysis - Tests of Association - Regression
70 pages
Regression Analysis: Basic Statistics
No ratings yet
Regression Analysis: Basic Statistics
26 pages
Correlation and Linear Regression
No ratings yet
Correlation and Linear Regression
51 pages
Sec2 Regression PDF
No ratings yet
Sec2 Regression PDF
183 pages
F Chapter Iv
No ratings yet
F Chapter Iv
19 pages
Correlation and Regression Bi-Variate Data: Let (X
No ratings yet
Correlation and Regression Bi-Variate Data: Let (X
11 pages
STAT630Slide Adv Data Analysis
No ratings yet
STAT630Slide Adv Data Analysis
238 pages
Nursing Research, Nursing Research, Nursing Research, Nursing Research, Leadership and Leadership and Leadership and Leadership and
No ratings yet
Nursing Research, Nursing Research, Nursing Research, Nursing Research, Leadership and Leadership and Leadership and Leadership and
18 pages
Lesson 11 Simple Linear Regression and Correlation
No ratings yet
Lesson 11 Simple Linear Regression and Correlation
38 pages
Unit 07 Regression Correlation
No ratings yet
Unit 07 Regression Correlation
36 pages
Regression 1
No ratings yet
Regression 1
32 pages
4 Regression Analysis
No ratings yet
4 Regression Analysis
44 pages
Chapter 9 Simple Linear Regression and Correlation
No ratings yet
Chapter 9 Simple Linear Regression and Correlation
56 pages
Unec 1711787818
No ratings yet
Unec 1711787818
6 pages
Regression: by Vijeta Gupta Amity University
No ratings yet
Regression: by Vijeta Gupta Amity University
15 pages
Regression Analysis
No ratings yet
Regression Analysis
49 pages
Chapter 5 - Eng
No ratings yet
Chapter 5 - Eng
20 pages
Ch17 Curve Fitting
No ratings yet
Ch17 Curve Fitting
44 pages
Regression: - Regression: - Linear Regression: - Uses
No ratings yet
Regression: - Regression: - Linear Regression: - Uses
14 pages
Topic 8 - Regression Analysis
No ratings yet
Topic 8 - Regression Analysis
51 pages
Unit 2 - Scatterplots Correlation and Regression Summer 2021
No ratings yet
Unit 2 - Scatterplots Correlation and Regression Summer 2021
43 pages
Lecture 2 Machine Learning
No ratings yet
Lecture 2 Machine Learning
20 pages
DMJAP LinearRegression 3
No ratings yet
DMJAP LinearRegression 3
28 pages
Session 15 Regression and Correlation
No ratings yet
Session 15 Regression and Correlation
66 pages
CH 16 Aslr
No ratings yet
CH 16 Aslr
41 pages
Regression Analysis
No ratings yet
Regression Analysis
34 pages
Simple Linear Regression Analysis
No ratings yet
Simple Linear Regression Analysis
7 pages
Bio-L8 - Correlation and Regression Analysis
No ratings yet
Bio-L8 - Correlation and Regression Analysis
15 pages
Relationships Between Student Engagement and Academic Achievement: A Meta-Analysis
No ratings yet
Relationships Between Student Engagement and Academic Achievement: A Meta-Analysis
13 pages
Unit 2-Part 3-Linear Regression
No ratings yet
Unit 2-Part 3-Linear Regression
38 pages
Topic 7 Linear Regreation CHP14
No ratings yet
Topic 7 Linear Regreation CHP14
21 pages
2.4.6. (Dholakia2000)
No ratings yet
2.4.6. (Dholakia2000)
28 pages
MAP 716 Lecture 4 Simple Linear Regression
No ratings yet
MAP 716 Lecture 4 Simple Linear Regression
23 pages
Introduction To Linear Regression
No ratings yet
Introduction To Linear Regression
6 pages
STAR Rando Questions Stats
No ratings yet
STAR Rando Questions Stats
14 pages
Sustainability 11 04315
No ratings yet
Sustainability 11 04315
22 pages
The Impact of Bureaucracy, Corruption and Tax Compliance: RAF 5,2 Ronald D. Picur and Ahmed Riahi-Belkaoui
No ratings yet
The Impact of Bureaucracy, Corruption and Tax Compliance: RAF 5,2 Ronald D. Picur and Ahmed Riahi-Belkaoui
7 pages
ECO2004 Ch13
No ratings yet
ECO2004 Ch13
13 pages
Data Science With Python Relationship
No ratings yet
Data Science With Python Relationship
30 pages
Regression Analysis
No ratings yet
Regression Analysis
18 pages
03 Revisions L Regression
No ratings yet
03 Revisions L Regression
25 pages
Regression Notes - Part-1
No ratings yet
Regression Notes - Part-1
17 pages
Output Input Linear Correlation Coefficient Regression Analysis
No ratings yet
Output Input Linear Correlation Coefficient Regression Analysis
6 pages
Ijepes D 25 00018
No ratings yet
Ijepes D 25 00018
45 pages
Turing Quest - NPC
No ratings yet
Turing Quest - NPC
29 pages
Regression Analysis
No ratings yet
Regression Analysis
21 pages
Regression and Correlation
No ratings yet
Regression and Correlation
14 pages
Introduction To Linear Regression
No ratings yet
Introduction To Linear Regression
6 pages
09 Inference For Regression Part1
No ratings yet
09 Inference For Regression Part1
12 pages
Regression
No ratings yet
Regression
6 pages
ML Assignment No. 1: 1.1 Title
No ratings yet
ML Assignment No. 1: 1.1 Title
8 pages
Topic:-Regression: Name: - Teotia Nidhi Class: - M.SC Biotechnology
No ratings yet
Topic:-Regression: Name: - Teotia Nidhi Class: - M.SC Biotechnology
11 pages
LinearRegression FoundationalMathofAI S24
No ratings yet
LinearRegression FoundationalMathofAI S24
4 pages
FDS Important Q
No ratings yet
FDS Important Q
5 pages
SimpleMultipleLinearRegression FoundationalMathofAI S24
No ratings yet
SimpleMultipleLinearRegression FoundationalMathofAI S24
6 pages
Pearson Correlation Coefficient
No ratings yet
Pearson Correlation Coefficient
7 pages
Module 11. Lesson Proper
No ratings yet
Module 11. Lesson Proper
5 pages
Coding 2
No ratings yet
Coding 2
3 pages
Chapter 10
No ratings yet
Chapter 10
3 pages
Wholesale Customer Retail
No ratings yet
Wholesale Customer Retail
1 page
A Case Study On Absenteeism and Academic Performance at University Level
No ratings yet
A Case Study On Absenteeism and Academic Performance at University Level
6 pages
Evolution of Pigments
No ratings yet
Evolution of Pigments
10 pages
Sample Conceptual Framework & Paradigm and Chapter 3
No ratings yet
Sample Conceptual Framework & Paradigm and Chapter 3
6 pages
De Sioetal
No ratings yet
De Sioetal
6 pages
3.1 - Preparing For Interviews
No ratings yet
3.1 - Preparing For Interviews
3 pages
Essay
No ratings yet
Essay
2 pages
AE CIVE265 Lab04 2024
No ratings yet
AE CIVE265 Lab04 2024
1 page
ENG Major Assignment - Exam Rubric
No ratings yet
ENG Major Assignment - Exam Rubric
1 page
3.2 - Interview Supports
No ratings yet
3.2 - Interview Supports
1 page
A-level Maths Revision: Cheeky Revision Shortcuts
From Everand
A-level Maths Revision: Cheeky Revision Shortcuts
Scool Revision
3.5/5 (8)
Worked Examples in Mathematics for Scientists and Engineers
From Everand
Worked Examples in Mathematics for Scientists and Engineers
G. Stephenson
No ratings yet

Chapter 7 Presentation - 11.18.2024

Uploaded by

Chapter 7 Presentation - 11.18.2024

Uploaded by

Statistics and Probability

෍ 2(−𝑥𝑖) 𝑦𝑖 − 𝑎 + 𝑏𝑥𝑖 Equation - 2

➢ “Sy” measures the overall spread of Y

- SY∣X Measures how well the regression 𝑆𝑦𝑦 −𝑏𝑆𝑥𝑦

You might also like