0% found this document useful (0 votes)

39 views7 pages

Multiple Regression and Issues in Regression Analysis

Uploaded by

pier Acosta

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

39 views7 pages

Multiple Regression and Issues in Regression Analysis

Uploaded by

pier Acosta

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

Page 1

2018, Study Session # 3, Reading # 10

“MULTIPLE REGRESSION AND ISSUES IN REGRESSION ANALYSIS”

MSR = Mean Regression Sum of Squares = Critical F taken from F
MSE = Mean Squared Error 1. INTRODUCTION Distribute Table
RSS = Regression Sum of Squares = Null Hypothesis
SSE = Sum of Squared Errors/Residuals ∝ = Alternative Hypothesis
α = Level of Significance Multiple linear regression models are more X = Independent Variable
sophisticated. Y = Dependent Variable
They incorporate more than one independent F = F Statistic (calculated)
variable.

2. MULTIPLE LINEAR REGRESSIONS

Allows determining effects of more than one independent variable on a

particular dependent variable
= + + + ⋯ +
Tells the impact on Y by changing X1 by 1 unit keeping other independent
variables same.
Individual slope coefficients (e.g. b1) in multiple regressions known as partial
regression/slope coefficients.

2.1 Assumption of the Multiple Linear Regression Model

Relationship b/w Y and , , , … is linear.

Independent variables are not random and no exact linear relationship exists
b/w 2 or more independent variables.
Expected value of error terms is 0.
Variance of error term is same for all observations.
Error term is uncorrelated across observations.
Error term is normally distributed.

2.2 Predicting the Dependent Variable in a Multiple Regression Model

Obtain estimates of regression parameters.

= ^ , ^ , ^ , … ^

= , , , …
Determine assumed values of , …
Compute predicted value of using = + + + ⋯ +
To predict dependent variable:
Be confident that assumptions of the regression are met.
Predictions regarding X must be within reliable range of data used to estimate the model.

2.3 Testing Whether All Population Regression Coefficients Equals Zero

⇒ All slope coefficients are simultaneously = 0, none of the X

variable helps explain Y.
To test F-test is used.
T-test cannot be used.
=

/
=

/( ())

Copyright © FinQuiz.com. All rights reserved.

Page 2
2018, Study Session # 3, Reading # 10

2.3 Testing Whether All Population Regression Coefficients Equals Zero

Where

= −

n = no. of observation
k = no. of slope coefficients
Decision rule ⇒ reject if F > FC (for given α).
It is a one-tailed test.
df numerator =k
df denominator =n-(k+1).
For k and n the test statistic representing H0, all slope coefficients are
equal to 0, is , ()
In F-distribution table , () where K represents column and n-
(k+1) represents row.
Significance of F in ANOVA table represents ‘p value’.
F-statistic chances of Type I error.

2.4 Adjusted R2

R2 with addition of independent variables (X) in regression

= 1 − 1 − !.

When k ≥ 1 ⇒ >

can be –ve but R2 is always +ve.

If is used for comparing regression models.

Sample size must be the same

Dependent variable is defined in the same way.
Does not necessarily indicate regression is well specified.

3. USING DUMMY VARIABLES IN REGRESSION

Dummy variable ⇒ takes 1 if particular condition is

true & 0 when it is false.
Diligence is required in choosing no. of dummy
variables.
Usually n-1 dummy variables are used
where n= no. of categories.

Copyright © FinQuiz.com. All rights reserved.

Page 3
2018, Study Session # 3, Reading # 10

4. VIOLATIONS OF REGRESSION ASSUMPTIONS

4.1 Heteroskedasticity

Variance of errors differs across observations ⇒ heteroskedastic

Variance of errors is similar across observations ⇒ homoskedastic
Usually no systematic relationship exists b/w X & regression residuals.
If systematic relationship is present ⇒ heteroskedasticity can exist.

4.1.1 The Consequence of Heteroskedasticity

It can lead to mistake in inference.

Does not affect consistency.
F-test becomes unreliable.
Due to biased estimators of standard errors, t-test also becomes unreliable.
Most likely result of heteroskedasticity is that the:
estimated standard errors will be underestimated.
t-statistic will be inflated.
Ignoring heteroskedasticity leads to significant relationship that does not exist actually.
It becomes more serious while developing investment strategy using regression analysis.
Unconditional heteroskedasticity ⇒ when heteroskedasticity of error variance is not correlated with
independent variables in the multiple regression.
Create major problems for statistical inference.
Conditional heteroskedasticity ⇒ when heteroskedasticity of error variance is correlated with the
independent variables.
It causes most problems.
Can be tested & corrected easily through many statistically software packages.

4.1.2 Testing for Heteroskedasticity

Breush-Pagan test is widely used.

Regression squared residuals of regression on independent variables.
Independent variables explain much of the variation of errors ⇒
conditional heteroskedasticity exists.
= no conditional heteroskedasticity exists.
= conditional heteroskedasticity exist

Under Breush-pagan test statistic = nR2

R2: from regression of squared residuals on X

Critical value ⇒ calculated χ2 distribution.

df = no. of independent variables

Reject if test-static > critical value.

Copyright © FinQuiz.com. All rights reserved.

Page 4
2018, Study Session # 3, Reading # 10

4.1.3 Correcting for Heteroskedasticity

Robust Standard Errors Generalized Least Squares

Corrects standard error of estimated Modify original equation.

coefficients. Requires economic expertise to
Also known as heteroskedasticity implement correctly on financial data.
consistent standards errors or white-
corrected standards errors.

4.2 Serial Correlation

Regression errors correlated across observations.

Usually arises in time-series regression.

4.2.1 The Consequences of Serial Correlation

Incorrect estimate of regression coefficient standard errors

Parameter estimates become inconsistent & invalid when Y is lagged onto X under serial
correlation.
Positive serial correlation ⇒ positive (negative) errors chance of positive (negative) errors
Negative serial correlation ⇒ positive (negative) errors chance of negative (positive) errors
It leads to wrong inferences
If positive serial correlation:
Standard errors underestimated
T-statistic & F-statistics inflated
Type-I error
If negative serial correlation
Standard errors overestimated
T-statistics & F-statistics understated
Type-II error

4.2.2 Testing for Serial Correlation

Variety of tests, most common → Durbin-Watson test

∑೅
"# =
೟ ೟షభమ
೟సమ
∑೅ మ
೟సభ ೟

Where = regression residual for period t.

For large sample size Durbin-Watson statistic (d) is approximately
→DW ≈ 2(1-r)
→where r = sample correlation b/w regression residuals of t and t-1
Values of DW can range from 0 to 4.
DW = 2 ⇒ r=0 ⇒ no serial correlation.
DW = 0 ⇒ r=1 ⇒ perfectly positively serially correlated.
DW = 4 ⇒ r = -1 ⇒ perfectly negatively serially correlated.
For positive serial correlation:
⇒ No positive serial correlation
⇒ Positive serial correlation
"# < $ ⇒ reject
"# > ⇒ do not reject
dl ≤ "# ≤ ⇒ inconclusive.

Copyright © FinQuiz.com. All rights reserved.

Page 5
2018, Study Session # 3, Reading # 10

4.2.2 Testing for Serial Correlation

For negative serial correlation:

⇒ No negative serial correlation.
⇒ Negative serial correlation.
"# > 4 − $ ⇒ Reject .
"# < 4 − ⇒ do not reject
4 − ≤ "# ≤ 4 − $ ⇒ inconclusive.

4.2.3 Correcting for Serial Correlation

Adjust the coefficient standard errors. Modify regression equation.

→ Recommended method Extreme care is required.
Hansen’s method ⇒ most prevalent one. May lead to inconsistent parameters
estimates.

4.3 Multicollinearity

Occurs when two or more independent variables (X) are highly

correlated with each other.
Regression can be estimated but result becomes problematic.
Serious practical concern due to commonly found approximate linear
relation among financial variables.

4.3.1 The Consequences of Multicollinearity

Difficulty in detecting significant relationships.

Estimates become extremely imprecise & unreliable though consistency is unaffected.
F-statistic is unaffected.
Standard errors of regression can .
Causing insignificant t-tests
Wide confidence interval
Type II error

4.3.2 Detecting Multicollinearity

Multicollinearity is a matter of degree rather than the presence / absence.

Pair wise correlation does not necessarily indicate presence of Multicollinearity
Pair wise correlation does not necessarily indicate absence of Multicollinearity
With 2 independent variables ⇒ correlation is a useful indicator.
R2 significant, F-statistic significant, insignificant t-statistic on slope coefficients ⇒
classic symptom of Multicollinearity

4.3.3 Correcting Multicollinearity

Exclude one or more regression variables.

In many cases, experimentation is done to determine
variable causing Multicollinearity

Page 6
2018, Study Session # 3, Reading # 10

5. MODEL SPECIFICATION AND ERRORS IN SPECIFICATION

Model specification ⇒ set of variables included in

regression.
Incorrect specification leads to biased & inconsistent
parameters

5.1 Principles of Model Specification

Model grounded on economic reasoning.

Functional form of variables compatible with nature of variables
Parsimonious ⇒ each included variable should play an essential role
Model is examined for the violation of regression assumptions.
Model is tested for the validity & usefulness of the out of sample data.

5.2 Misspecified Functional Form

One or more variables are omitted. If omitted variable is correlated with

remaining variable, error term will also be correlated with the latter and
the:
result can be biased & inconsistent.
estimated standard errors of the coefficients will be inconsistent.
One or more variables may require transformation.
Pooling of data from different samples that should not be pooled.
Can lead to spurious results.

5.3 Times-Series Misspecification (Independent Variables Correlated with Errors)

Including lagged variables (dependent) as independent

with serial correlation.
Including a function of the dependent variable as an
independent variable.
Independent variables measured with error

5.4 Other Types of Time-Series Misspecification

Nonstationarity: variable properties, e.g. mean, are

not constant through time.
In practice nonstationarity is a serious problem.

Page 7
2018, Study Session # 3, Reading # 10

6. MODELS WITH QUALITATIVE DEPENDENT VARIABLES

Qualitative dependent variables ⇒ dummy variables used as dependent instead of

independent.
Probit model⇒ based on normal distribution estimates the probability:
of discrete outcome, given values of independent variables used to explain that
outcome.
that Y=1, implying a condition is met.
Logit model:
Identical to Probit model.
Based on logistic distribution.
Both Logit and Probit models must be estimated using maximum likelihood methods.
Discriminate analysis ⇒ can be used to create an overall score that is used for classification.
Qualitative dependent variable models can be used for portfolio management and business
management.

Salt Cfa Level 2 Formulasheet 2024
100% (3)
Salt Cfa Level 2 Formulasheet 2024
19 pages
Salt Cfa Level 2 Formulasheet 2025
No ratings yet
Salt Cfa Level 2 Formulasheet 2025
19 pages
Chapter 5 Violations of CLRM Assumptions
100% (2)
Chapter 5 Violations of CLRM Assumptions
25 pages
High Yield Notes
No ratings yet
High Yield Notes
251 pages
Diagnostic Tests
No ratings yet
Diagnostic Tests
51 pages
New Section 1
No ratings yet
New Section 1
39 pages
Ch5 Slides Ed3 Feb2021
No ratings yet
Ch5 Slides Ed3 Feb2021
49 pages
MFIN 305 - Lecture3
No ratings yet
MFIN 305 - Lecture3
66 pages
OLS Assumptions
No ratings yet
OLS Assumptions
40 pages
Fin Quiz Bank
100% (1)
Fin Quiz Bank
23 pages
Intro To Econometrics Latter Half Chanon-1016098-17101310898743
No ratings yet
Intro To Econometrics Latter Half Chanon-1016098-17101310898743
15 pages
Chapter 5
No ratings yet
Chapter 5
45 pages
Regression and Assumptions
No ratings yet
Regression and Assumptions
49 pages
01 - Quantitative Methods
No ratings yet
01 - Quantitative Methods
28 pages
Ch5 Slides
No ratings yet
Ch5 Slides
32 pages
FinQuiz - Smart Summary, Study Session 3, Reading 10
No ratings yet
FinQuiz - Smart Summary, Study Session 3, Reading 10
7 pages
CFA Level 2 1712974289
No ratings yet
CFA Level 2 1712974289
19 pages
Chapter 5 - Violations of Regression Assumptions
No ratings yet
Chapter 5 - Violations of Regression Assumptions
44 pages
Lecture 3
No ratings yet
Lecture 3
35 pages
Econometrics For Finance Chapter 4
No ratings yet
Econometrics For Finance Chapter 4
44 pages
Chris Brooks - Chapter 5 - Slides
No ratings yet
Chris Brooks - Chapter 5 - Slides
71 pages
Econometrics
No ratings yet
Econometrics
46 pages
Ch5 - Slides 2022 - 11 - 29 - L1
No ratings yet
Ch5 - Slides 2022 - 11 - 29 - L1
35 pages
Assumption Checking On Linear Regression
No ratings yet
Assumption Checking On Linear Regression
65 pages
Classical Linear Regression Model Assumptions and Diagnostics
No ratings yet
Classical Linear Regression Model Assumptions and Diagnostics
71 pages
Multivariate Regression
No ratings yet
Multivariate Regression
20 pages
Classical Linear Regression Model Assumptions and Diagnostics
No ratings yet
Classical Linear Regression Model Assumptions and Diagnostics
66 pages
Classical Linear Regression Model Assumptions and Diagnostics
No ratings yet
Classical Linear Regression Model Assumptions and Diagnostics
71 pages
2 Quantitative
No ratings yet
2 Quantitative
19 pages
4 Regression Issues
No ratings yet
4 Regression Issues
44 pages
Business Forecasting J. Holton (1) - 251-300
No ratings yet
Business Forecasting J. Holton (1) - 251-300
50 pages
OLS Assumptions and Diagnostics
No ratings yet
OLS Assumptions and Diagnostics
18 pages
Chapter 4
No ratings yet
Chapter 4
38 pages
Multiple Regression and Issues in Regression Analysis
No ratings yet
Multiple Regression and Issues in Regression Analysis
25 pages
CFA Level II: Quantitative Methods
No ratings yet
CFA Level II: Quantitative Methods
20 pages
Chapter 4 MLR
No ratings yet
Chapter 4 MLR
17 pages
05 Diagnostic Test of CLRM 2
No ratings yet
05 Diagnostic Test of CLRM 2
39 pages
2023 Level II Key Facts and Formula Sheet (KFFS)
No ratings yet
2023 Level II Key Facts and Formula Sheet (KFFS)
14 pages
Lecture 2
No ratings yet
Lecture 2
13 pages
4.1 Multiple Regression Models
No ratings yet
4.1 Multiple Regression Models
6 pages
FinQuiz - Curriculum Note, @InsightSquad Study Session 2, Reading 5
No ratings yet
FinQuiz - Curriculum Note, @InsightSquad Study Session 2, Reading 5
11 pages
Economic
No ratings yet
Economic
11 pages
Reading 07-Correlation and Regression
No ratings yet
Reading 07-Correlation and Regression
18 pages
ch12 Autocorrelation
100% (1)
ch12 Autocorrelation
36 pages
Level 2 r12 Multiple Regression
No ratings yet
Level 2 r12 Multiple Regression
29 pages
SST Ûr Var: Principles of Econometrics - Class of October 14 Feunl
No ratings yet
SST Ûr Var: Principles of Econometrics - Class of October 14 Feunl
18 pages
CFA LVL II Quantitative Methods Study Notes
No ratings yet
CFA LVL II Quantitative Methods Study Notes
10 pages
Practice Questions - Quantitative - Reading 7
No ratings yet
Practice Questions - Quantitative - Reading 7
13 pages
Ontents: Foreword Preface To The Fourth Edition
No ratings yet
Ontents: Foreword Preface To The Fourth Edition
12 pages
L1 QM07 High Yield Notes
No ratings yet
L1 QM07 High Yield Notes
4 pages
Econometrics
No ratings yet
Econometrics
23 pages
ECF For Graduate Exam Revision 2021
No ratings yet
ECF For Graduate Exam Revision 2021
6 pages
Econometrics Q 19 8 2023
No ratings yet
Econometrics Q 19 8 2023
4 pages
FinQuiz - Curriculum Note, Study Session 2, Reading 4
No ratings yet
FinQuiz - Curriculum Note, Study Session 2, Reading 4
5 pages
Chapter 6
No ratings yet
Chapter 6
5 pages
3RD Periodical Exam in Stat
91% (11)
3RD Periodical Exam in Stat
7 pages
Handbook of Statistical Methods For Randomized Controlled Trials 1st Edition Fast Ebook Download
100% (15)
Handbook of Statistical Methods For Randomized Controlled Trials 1st Edition Fast Ebook Download
16 pages
Syllabus MSC 2025 Feburary
No ratings yet
Syllabus MSC 2025 Feburary
34 pages
All Slides
No ratings yet
All Slides
289 pages
Analysis of Variance For Random Models Volume I Balanced Data Theory, Methods, Applications and Data Analysis PDF Ebook With Full Chapters
100% (16)
Analysis of Variance For Random Models Volume I Balanced Data Theory, Methods, Applications and Data Analysis PDF Ebook With Full Chapters
17 pages
Belisa Aliyi - Assignments - For - Econometrics
No ratings yet
Belisa Aliyi - Assignments - For - Econometrics
34 pages
CH 04 Wooldridge 5e PPT
No ratings yet
CH 04 Wooldridge 5e PPT
39 pages
SmartPLS Report
No ratings yet
SmartPLS Report
201 pages
CFA Level I 1 Mock Exam June, 2018 Revision 1
No ratings yet
CFA Level I 1 Mock Exam June, 2018 Revision 1
74 pages
CFA Level I 1 Mock Exam June, 2018 Revision 1
No ratings yet
CFA Level I 1 Mock Exam June, 2018 Revision 1
38 pages
Thesis Using Multiple Linear Regression
75% (4)
Thesis Using Multiple Linear Regression
7 pages
Slide 6
No ratings yet
Slide 6
37 pages
Econometrics Problems Autocorrelation An
No ratings yet
Econometrics Problems Autocorrelation An
42 pages
Urban Transport
No ratings yet
Urban Transport
42 pages
Mc031 - Gsdp8118 Advanced Research Methodology
No ratings yet
Mc031 - Gsdp8118 Advanced Research Methodology
42 pages
Linear Regression
No ratings yet
Linear Regression
20 pages
1.the Role of Foreign Direct Investment in Economic Development A Study of Nigeria
No ratings yet
1.the Role of Foreign Direct Investment in Economic Development A Study of Nigeria
15 pages
BS-Assignment 2 Solution
No ratings yet
BS-Assignment 2 Solution
11 pages
Dec 2023 BECC-110
No ratings yet
Dec 2023 BECC-110
7 pages
Results Excel
No ratings yet
Results Excel
20 pages
Unit 2 Notes
No ratings yet
Unit 2 Notes
14 pages
Correlation and Regression Correlation: Correlation Between 2 Variables X and Y Indicates Whether They Are Related To Each
No ratings yet
Correlation and Regression Correlation: Correlation Between 2 Variables X and Y Indicates Whether They Are Related To Each
18 pages
Presentation 1
No ratings yet
Presentation 1
12 pages
L 3 Formulasheetjune 2016 Sample
No ratings yet
L 3 Formulasheetjune 2016 Sample
6 pages
Rosdiana 3
No ratings yet
Rosdiana 3
11 pages
Test Help Stat
No ratings yet
Test Help Stat
18 pages
Simple Linear Regression
No ratings yet
Simple Linear Regression
5 pages
L 3 Ss 12 Los 25
No ratings yet
L 3 Ss 12 Los 25
12 pages
Lecture 10 PDF
No ratings yet
Lecture 10 PDF
7 pages
Histograms Questions
No ratings yet
Histograms Questions
6 pages
Paul Charlent Case Scenario
No ratings yet
Paul Charlent Case Scenario
4 pages
6 One Hot Encoding
No ratings yet
6 One Hot Encoding
3 pages
The Time Value of Money
No ratings yet
The Time Value of Money
2 pages
Calculus Refresher
From Everand
Calculus Refresher
A. A. Klaf
3/5 (8)
Multiple Integrals, A Collection of Solved Problems
From Everand
Multiple Integrals, A Collection of Solved Problems
Steven Tan
No ratings yet
A Treatise on the Calculus of Finite Differences
From Everand
A Treatise on the Calculus of Finite Differences
George Boole
4/5 (1)
A-level Maths Revision: Cheeky Revision Shortcuts
From Everand
A-level Maths Revision: Cheeky Revision Shortcuts
Scool Revision
3.5/5 (8)
Lectures on Measure and Integration
From Everand
Lectures on Measure and Integration
Harold Widom
No ratings yet
Exercises of Multi-Variable Functions
From Everand
Exercises of Multi-Variable Functions
Simone Malacrida
No ratings yet

Multiple Regression and Issues in Regression Analysis

Uploaded by

Multiple Regression and Issues in Regression Analysis

Uploaded by

Page 1

2018, Study Session # 3, Reading # 10

“MULTIPLE REGRESSION AND ISSUES IN REGRESSION ANALYSIS”

2. MULTIPLE LINEAR REGRESSIONS

Allows determining effects of more than one independent variable on a

2.1 Assumption of the Multiple Linear Regression Model

Relationship b/w Y and  ,  ,  , …  is linear.

2.2 Predicting the Dependent Variable in a Multiple Regression Model

Obtain estimates of regression parameters.

2.3 Testing Whether All Population Regression Coefficients Equals Zero

⇒ All slope coefficients are simultaneously = 0, none of the X

Copyright © FinQuiz.com. All rights reserved.

2.3 Testing Whether All Population Regression Coefficients Equals Zero

 =  −  

R2 with addition of independent variables (X) in regression

 can be –ve but R2 is always +ve.

If  is used for comparing regression models.

Sample size must be the same

3. USING DUMMY VARIABLES IN REGRESSION

Dummy variable ⇒ takes 1 if particular condition is

Copyright © FinQuiz.com. All rights reserved.

4. VIOLATIONS OF REGRESSION ASSUMPTIONS

Variance of errors differs across observations ⇒ heteroskedastic

4.1.1 The Consequence of Heteroskedasticity

It can lead to mistake in inference.

4.1.2 Testing for Heteroskedasticity

Breush-Pagan test is widely used.

Copyright © FinQuiz.com. All rights reserved.

4.1.3 Correcting for Heteroskedasticity

Robust Standard Errors Generalized Least Squares

Corrects standard error of estimated Modify original equation.

4.2 Serial Correlation

Regression errors correlated across observations.

4.2.1 The Consequences of Serial Correlation

Incorrect estimate of regression coefficient standard errors

4.2.2 Testing for Serial Correlation

Variety of tests, most common → Durbin-Watson test

Where  = regression residual for period t.

Copyright © FinQuiz.com. All rights reserved.

4.2.2 Testing for Serial Correlation

For negative serial correlation:

4.2.3 Correcting for Serial Correlation

Adjust the coefficient standard errors. Modify regression equation.

Occurs when two or more independent variables (X) are highly

4.3.1 The Consequences of Multicollinearity

Difficulty in detecting significant relationships.

4.3.2 Detecting Multicollinearity

Multicollinearity is a matter of degree rather than the presence / absence.

4.3.3 Correcting Multicollinearity

Exclude one or more regression variables.

Copyright © FinQuiz.com. All rights reserved.

5. MODEL SPECIFICATION AND ERRORS IN SPECIFICATION

Model specification ⇒ set of variables included in

5.1 Principles of Model Specification

Model grounded on economic reasoning.

5.2 Misspecified Functional Form

One or more variables are omitted. If omitted variable is correlated with

5.3 Times-Series Misspecification (Independent Variables Correlated with Errors)

Including lagged variables (dependent) as independent

5.4 Other Types of Time-Series Misspecification

Nonstationarity: variable properties, e.g. mean, are

Copyright © FinQuiz.com. All rights reserved.

6. MODELS WITH QUALITATIVE DEPENDENT VARIABLES

Qualitative dependent variables ⇒ dummy variables used as dependent instead of

Copyright © FinQuiz.com. All rights reserved.

You might also like

Relationship b/w Y and , , , … is linear.

= −

can be –ve but R2 is always +ve.

If is used for comparing regression models.

Where = regression residual for period t.