0% found this document useful (0 votes)

134 views8 pages

Wooldridge 6e Ch05 IM

Uploaded by

thuongnguyen.31211022387

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

134 views8 pages

Wooldridge 6e Ch05 IM

Uploaded by

thuongnguyen.31211022387

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 8

57

CHAPTER 5
Multiple Regression Analysis: OLS Asymptotics
Table of Contents

Teaching notes 58
Solutions to Problems 59
Solutions to Computer Exercises 60

© 2016 Cengage Learning®. May not be scanned, copied or duplicated, or posted to a publicly accessible website,
in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise
on a password-protected website or school-approved learning management system for classroom use.
58

TEACHING NOTES

Chapter 5 is short, but it is conceptually more difficult than the earlier chapters, primarily
because it requires some knowledge of the asymptotic properties of estimators. In class, I give a
brief, heuristic description of consistency and asymptotic normality before stating the
consistency and asymptotic normality of OLS. (Conveniently, the same assumptions that work
for finite sample analysis work for asymptotic analysis.) More advanced students can follow the
proof of consistency of the slope coefficient in the bivariate regression case. Section E.4 contains
a full matrix treatment of asymptotic analysis appropriate for a master’s level course.

An explicit illustration of what happens to standard errors as the sample size grows emphasizes
the importance of having a larger sample. I do not usually cover the LM statistic in a first-
semester course, and I only briefly mention the asymptotic efficiency result. Without full use of
matrix algebra combined with limit theorems for vectors and matrices, it is difficult to prove
asymptotic efficiency of OLS.

I think the conclusions of this chapter are important for students to know, even though they may
not fully grasp the details. On exams, I usually include true-false type questions, with
explanations, to test the students’ understanding of asymptotics. [For example: “In large
samples we do not have to worry about omitted variable bias.” (False). Or “Even if the error
term is not normally distributed, in large samples we can still compute approximately valid
confidence intervals under the Gauss-Markov assumptions.” (True).]

SOLUTIONS TO PROBLEMS

5.1 Write y = β 0 + β1 x 1 + u, and take the expected value: E(y) = β 0 + β1 E(x 1 ) + E(u), or µ y =
β 0 + β1 µ x since E(u) = 0, where µ y = E(y) and µ x = E(x 1 ). We can rewrite this as β 0 = µ y - β1
µ x . Now, β̂ 0 = y − β̂1 x1 . Taking the plim of this we have plim( β̂ 0 ) = plim( y − β̂1 x1 ) =
plim( y ) – plim( β̂1 ) ⋅ plim( x1 ) = µ y − β1 µ x , where we use the fact that plim( y ) = µ y and plim(
x1 ) = µ x by the law of large numbers, and plim( β̂1 ) = β1 . We have also used the parts of
Property PLIM.2 from Appendix C.

5.2 A higher tolerance of risk means more willingness to invest in the stock market, so β 2 > 0.
By assumption, funds and risktol are positively correlated. Now we use equation (5.5), where
δ 1 > 0: plim( β1 ) = β1 + β 2 δ 1 > β1 , so β1 has a positive inconsistency (asymptotic bias). This
makes sense; if we omit risktol from the regression and it is positively correlated with funds,
some of the estimated effect of funds is actually due to the effect of risktol.

5.3 The variable cigs has nothing close to a normal distribution in the population. Most people
do not smoke, so cigs = 0 for over half of the population. A normally distributed random
variable takes on no particular value with positive probability. Further, the distribution of cigs is
skewed, whereas a normal random variable must be symmetric about its mean.

5.4 Write y = β 0 + β1 x + u, and take the expected value: E(y) = β 0 + β1 E(x) + E(u), or µ y =
β 0 + β1 µ x , since E(u) = 0, where µ y = E(y) and µ x = E(x). We can rewrite this as β 0 = µ y −
β µ x . Now, β = y − β x . Taking the plim of this we have plim( β ) = plim( y − β x ) =
1 0 1 0 1

plim( y ) – plim( β1 )⋅plim( x ) = µ y − β1 µ x , where we use the fact that plim( y ) = µ y and plim( x
) = µ x by the law of large numbers, and plim( β ) = β . We have also used the parts of the
1 1

Property PLIM.2 from Appendix C.

𝑥𝑥−𝜇𝜇 100−72.60
5.5 (i) Yes, the answer would be zero because z = = = 59.82 and P(z > 59.82) =
𝜎𝜎/√𝑛𝑛 13.400/√856
1 – P(z < 59.82) = 1 – 1 = 0. Half of the students have scores less than the average value of
72.60, and no student can score more than 100. Therefore, the answer contradict the assumption
of a normal distribution for score.

(ii) By observing the histogram, we can say that a very small proportion of the students have
scored less than 60. No, the normal distribution does not fit well in the left tail.

SOLUTIONS TO COMPUTER EXERCISES

C5.1 (i) The estimated equation is

 = −2.87 + .599 educ + .022 exper + .169 tenure

wage
(0.73) (.051) (.012) (.022)
n = 526, R2 = .306, σˆ = 3.085.

Seen below is a histogram of the 526 residuals, uˆi , i = 1, 2 , …, 526. The histogram uses 27
bins, which is suggested by the formula in the Stata manual for 526 observations. For
comparison, the normal distribution that provides the best fit to the histogram is also plotted.

.18

.13
Fraction

.08

.04

0
-8 -4 -2 0 2 6 10 15
uhat

(ii) With log(wage) as the dependent variable, the estimated equation is


log( wage) = .284 + .092 educ + .0041 exper + .022 tenure
(.104) (.007) (.0017) (.003)
n = 526, R2 = .316, σˆ = .441.

The histogram for the residuals from this equation, with the best-fitting normal distribution
overlaid, is given below:

.14

.1
Fraction

.06

.03

0
-2 -1 0 1.5
uhat

(iii) The residuals from the log(wage) regression appear to be more normally distributed.
Certainly the histogram in part (ii) fits under its comparable normal density better than the one in
part (i), and the histogram for the wage residuals is notably skewed to the left. In the wage
regression, there are some very large residuals (roughly equal to 15) that lie almost five
estimated standard deviations ( σˆ = 3.085) from the mean of the residuals, which is identically
zero, of course. Residuals far from zero do not appear to be nearly as much of a problem in the
log(wage) regression.

C5.2 (i) The regression with all 4,137 observations is

 = 1.392 − .01352 hsperc + .00148 sat

colgpa
(0.072) (.00055) (.00007)
n = 4,137, R2 = .273.

(ii) Using only the first 2,070 observations gives

 = 1.436 − .01275 hsperc + .00147 sat

colgpa
(0.098) (.00072) (.00009)
n = 2,070, R2 = .283.

(iii) The ratio of the standard error using 2,070 observations to that using 4,137 observations
is about 1.04. From (5.10), we compute (4,137 / 2, 070) ≈ 1.41, which is somewhat above the
ratio of the actual standard errors but reasonably close.

C5.3 We first run the regression bwght on cigs, parity, and faminc using only the 1,191
observations with nonmissing observations on motheduc and fatheduc. After obtaining these
residuals, ui , these are regressed on cigs i , parity i , faminc i , motheduc i , and fatheduc i , where, of
course, we can only use the 1,191 observations with nonmissing values for both motheduc and
fatheduc. The R-squared from this regression, Ru2 is about .0024. With 1,191 observations, the
chi-square statistic is (1,191)(.0024) ≈ 2.86. The p-value from the χ 22 distribution is about .239,
which is very close to .242, the p-value for the comparable F test.

C5.4 (i) The measure of skewness for inc is about 1.86. When we use log(inc), the skewness
measure is about .360. Therefore, there is much less skewness in log of income, which means inc
is less likely to be normally distributed. (In fact, the skewness in income distributions is a well-
documented fact across many countries and time periods.)

(ii) The skewness for bwght is about −.60. When we use log(bwght), the skewness measure
is about −2.95. In this case, there is much more skewness after taking the natural log.

(iii) The example in part (ii) clearly shows that this statement cannot hold generally. It is
possible to introduce skewness by taking the natural log. As an empirical matter, for many
economic variables, particularly dollar values, taking the log often does help to reduce or
eliminate skewness. But it does not have to.

(iv) For the purposes of regression analysis, we should be studying the conditional
distributions: that are, the distributions of y and log(y) conditional on the explanatory variables,
x1 , ..., xk . If we think the mean is linear, as in Assumptions MLR.1 and MLR.3, then this is
equivalent to studying the distribution of the population error, u. In fact, the skewness measure
studied in this question often is applied to the residuals from and OLS regression.

C5.5 (i) The variable educ takes on all integer values from 6 to 20, inclusive. So it takes on 15
distinct values. It is not a continuous random variable, nor does it make sense to think of it as
approximately continuous. (Contrast a variable such as hourly wage, which is rounded to two
decimal places but takes on so many different values it makes sense to think of it as continuous.)

(ii) With a discrete variable, usually, a histogram has bars centered at each outcome, with
the height being the fraction of observations taking on the value. Such a histogram, with a
normal distribution overlay, is given below.
© 2016 Cengage Learning®. May not be scanned, copied or duplicated, or posted to a publicly accessible website,
in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise
on a password-protected website or school-approved learning management system for classroom use.
63

density
.4
.3
.2
.1
0

4 6 8 10 12 14 16 18 20 22
highest grade completed by 1991

Even discounting the discreteness, the best fitting normal distribution (matching the sample
mean and variance) fits poorly. The focal point at educ = 12 clearly violates the notion of a
smooth bell-shaped density.

(iii) Given the findings in part (iii), the error term in the equation

β 0 β1motheduc + β 2 fatheduc + β3abil + β 4 abil 2 + u

educ =+

cannot have a normal distribution independent of the explanatory variables. Thus, MLR.6 is
violated. In fact, the inequality educ ≥ 0 means that u is not even free to vary over all values
given motheduc, fatheduc, and abil. (It is likely that the homoskedasticity assumption fails, too,
but this is less clear and does not follow from the nature of educ.)

The violation of MLR.6 means that we cannot perform exact statistical inference; we must
rely on asymptotic analysis. This in itself does not change how we perform statistical inference:
without normality, we use exactly the same methods, but we must be aware that our inference
holds only approximately.

C5.6 (i) Logically, the smallest and the largest values of score would be 0 and 100, respectively.
In the sample, the smallest and the largest values of score are 19.53 and 98.44, respectively.

(ii) The distribution of score is skewed to left tail. This violates the normality assumption even
conditional on the explanatory variables. Therefore, the assumption MLR.6 does not hold for the
error term u and score will not be normally distributed, which means that the t statistics will not
have t distributions and the F statistics will not have F distributions. This is a potentially serious
problem because our inference hinges on being able to obtain critical values or p-values from the
t or F distributions.

(iii) 𝑠𝑠𝑠𝑠𝑠𝑠𝑠𝑠𝑠𝑠
� = 27.43 + 13.801𝑐𝑐𝑐𝑐𝑐𝑐𝑐𝑐𝑐𝑐𝑐𝑐 + 0.54𝑎𝑎𝑎𝑎𝑎𝑎𝑎𝑎𝑎𝑎ℎ − 0.26𝑎𝑎𝑎𝑎𝑎𝑎𝑎𝑎𝑎𝑎𝑎𝑎 + 𝑢𝑢.

t value for acteng is -2.48 and the corresponding p - value is 0.013. If the sample size is large,
then the t-distribution can be approximated to the distribution of the t statistics when the error
term is not normally distributed.

Midterm Fall2011
No ratings yet
Midterm Fall2011
13 pages
CH 04 Wooldridge 5e PPT PDF
No ratings yet
CH 04 Wooldridge 5e PPT PDF
39 pages
Introductory Econometrics A Modern Approach 5th Edition Wooldridge Solutions Manual 1
100% (78)
Introductory Econometrics A Modern Approach 5th Edition Wooldridge Solutions Manual 1
6 pages
Assignments
No ratings yet
Assignments
6 pages
Machine Learning Interview Questions
From Everand
Machine Learning Interview Questions
Tech Interviews
4.5/5 (2)
Placer Gold Operations Manual
100% (1)
Placer Gold Operations Manual
178 pages
Wooldridge 7e Ch05 IM
No ratings yet
Wooldridge 7e Ch05 IM
9 pages
Introductory Econometrics A Modern Approach 5th Edition Wooldridge Solutions Manual 1
100% (51)
Introductory Econometrics A Modern Approach 5th Edition Wooldridge Solutions Manual 1
26 pages
Wooldridge 7e Ch05 SM
No ratings yet
Wooldridge 7e Ch05 SM
5 pages
Wooldridge 7e Ch06 SM
No ratings yet
Wooldridge 7e Ch06 SM
9 pages
Wooldridge 7e Ch06 IM
100% (1)
Wooldridge 7e Ch06 IM
20 pages
Chapter5 - Solution Manual
No ratings yet
Chapter5 - Solution Manual
4 pages
CH 04 Wooldridge 5e ppt20250307
No ratings yet
CH 04 Wooldridge 5e ppt20250307
56 pages
CH 04 Wooldridge 6e PPT Updated
No ratings yet
CH 04 Wooldridge 6e PPT Updated
39 pages
Stat2 2023 Syllabus B v1.0 Weeks 5-6-7
No ratings yet
Stat2 2023 Syllabus B v1.0 Weeks 5-6-7
41 pages
Solutions Week 10
No ratings yet
Solutions Week 10
7 pages
Scott and Watson CHPT 4 Solutions
No ratings yet
Scott and Watson CHPT 4 Solutions
4 pages
Problems 1
No ratings yet
Problems 1
4 pages
Multiple Regression Analysis: Inference: Wooldridge: Introductory Econometrics: A Modern Approach, 5e
No ratings yet
Multiple Regression Analysis: Inference: Wooldridge: Introductory Econometrics: A Modern Approach, 5e
23 pages
Answered Sheets Combined
No ratings yet
Answered Sheets Combined
52 pages
Nu - Edu.kz Econometrics-I Assignment 4 Answer Key
No ratings yet
Nu - Edu.kz Econometrics-I Assignment 4 Answer Key
4 pages
HW 3
No ratings yet
HW 3
9 pages
Wooldridge 7e Ch03 SM
100% (1)
Wooldridge 7e Ch03 SM
11 pages
Advanced Econometrics: Instructor: Kanika Mahajan
No ratings yet
Advanced Econometrics: Instructor: Kanika Mahajan
36 pages
DocScanner May 30, 2024 17-22
No ratings yet
DocScanner May 30, 2024 17-22
13 pages
CH 05 Ans
No ratings yet
CH 05 Ans
15 pages
SW 2e Ex ch05
No ratings yet
SW 2e Ex ch05
5 pages
Take-Home Quiz: (MM: 60 Marks)
No ratings yet
Take-Home Quiz: (MM: 60 Marks)
3 pages
CH 04 Wooldridge 5e PPT
No ratings yet
CH 04 Wooldridge 5e PPT
39 pages
Panel Data Problem Set 2
No ratings yet
Panel Data Problem Set 2
6 pages
Shanghai Jiaotong University Shanghai Advanced Institution of Finance
No ratings yet
Shanghai Jiaotong University Shanghai Advanced Institution of Finance
3 pages
6-Econometrics-Linear Regression
No ratings yet
6-Econometrics-Linear Regression
16 pages
CBCS Core - Introductory Econometrics Semester 4th
No ratings yet
CBCS Core - Introductory Econometrics Semester 4th
28 pages
11 Final Solutions
100% (1)
11 Final Solutions
19 pages
No Linealidades Stock Watson
No ratings yet
No Linealidades Stock Watson
59 pages
Lecture 4
No ratings yet
Lecture 4
22 pages
CH 04 Wooldridge 5e PPT
No ratings yet
CH 04 Wooldridge 5e PPT
39 pages
CH 05
No ratings yet
CH 05
18 pages
Tutorial
No ratings yet
Tutorial
28 pages
Introduction To Econometrics - Stock & Watson - CH 6 Slides
No ratings yet
Introduction To Econometrics - Stock & Watson - CH 6 Slides
59 pages
Homework 1: PHD Student: Freddy Rojas Cama Homework 1 - Eco 322 Rutgers University
No ratings yet
Homework 1: PHD Student: Freddy Rojas Cama Homework 1 - Eco 322 Rutgers University
3 pages
CH 05 Wooldridge 5e PPT
No ratings yet
CH 05 Wooldridge 5e PPT
8 pages
CH 05 Wooldridge 5e PPT
No ratings yet
CH 05 Wooldridge 5e PPT
8 pages
Chapter 6: Ordinary Least Squares Estimation Procedure - The Properties
No ratings yet
Chapter 6: Ordinary Least Squares Estimation Procedure - The Properties
34 pages
Econometrics: Domodar N. Gujarati
No ratings yet
Econometrics: Domodar N. Gujarati
36 pages
27.12.10h15 KTLTC De-1
No ratings yet
27.12.10h15 KTLTC De-1
6 pages
Econometrics I - Lecture 5 (Wooldridge) Color
No ratings yet
Econometrics I - Lecture 5 (Wooldridge) Color
44 pages
Basic Econometrics 4987
No ratings yet
Basic Econometrics 4987
56 pages
Handout - Basic Regression - Analysis
No ratings yet
Handout - Basic Regression - Analysis
14 pages
Econometrics Chapter 8 PPT Slides
100% (1)
Econometrics Chapter 8 PPT Slides
42 pages
Additional Problem Set Units I and II
No ratings yet
Additional Problem Set Units I and II
8 pages
Classical Normal Linear Regression Model
No ratings yet
Classical Normal Linear Regression Model
13 pages
ps5 Fall+2015
No ratings yet
ps5 Fall+2015
9 pages
Lesson 2 Statistical Inference
No ratings yet
Lesson 2 Statistical Inference
45 pages
Basic Econometrics - II
No ratings yet
Basic Econometrics - II
30 pages
2023 Past Year Question Paper
No ratings yet
2023 Past Year Question Paper
6 pages
405 Econometrics: Domodar N. Gujarati
No ratings yet
405 Econometrics: Domodar N. Gujarati
12 pages
CH 05 Wooldridge 5e
No ratings yet
CH 05 Wooldridge 5e
8 pages
CH 05 Wooldridge 5e PPT
No ratings yet
CH 05 Wooldridge 5e PPT
8 pages
Solutions Manual to accompany Introduction to Linear Regression Analysis
From Everand
Solutions Manual to accompany Introduction to Linear Regression Analysis
Douglas C. Montgomery
1/5 (1)
Business Calculus Demystified
From Everand
Business Calculus Demystified
Rhonda Huettenmueller
No ratings yet
How Much Power
No ratings yet
How Much Power
5 pages
What Can You Grow Hydroponically?: Flowers
No ratings yet
What Can You Grow Hydroponically?: Flowers
11 pages
Project Africa Now
No ratings yet
Project Africa Now
6 pages
Pakistan Tobacco Company Assignment
No ratings yet
Pakistan Tobacco Company Assignment
9 pages
Quikcalc Eplus - Esercizio 21
No ratings yet
Quikcalc Eplus - Esercizio 21
1 page
Dissertation On Investment Analysis
100% (2)
Dissertation On Investment Analysis
5 pages
Marketing Principles
No ratings yet
Marketing Principles
54 pages
English 4: Quarter 1: Week 3
No ratings yet
English 4: Quarter 1: Week 3
12 pages
Imo Cnew Series
No ratings yet
Imo Cnew Series
6 pages
PHD - Aerodynamics of Flexible Membranes
No ratings yet
PHD - Aerodynamics of Flexible Membranes
165 pages
SME - Metal Enclosed Switchgears
No ratings yet
SME - Metal Enclosed Switchgears
4 pages
Theory of Literature
No ratings yet
Theory of Literature
5 pages
ARUNKUMAR K - Profama Invoice
No ratings yet
ARUNKUMAR K - Profama Invoice
2 pages
PSAT Bahasa Inggris Kelas 10
No ratings yet
PSAT Bahasa Inggris Kelas 10
5 pages
Their Lives Before The Throne S1PDF-1
No ratings yet
Their Lives Before The Throne S1PDF-1
652 pages
App 002 Final Exam Reviewer
No ratings yet
App 002 Final Exam Reviewer
3 pages
Listening Grammar Vocabulary 1-Merged
100% (1)
Listening Grammar Vocabulary 1-Merged
9 pages
Brushless DC Motors
No ratings yet
Brushless DC Motors
21 pages
Atlantic International University - Wikipedia
No ratings yet
Atlantic International University - Wikipedia
4 pages
Application of IR - ITC
No ratings yet
Application of IR - ITC
23 pages
2020 05 22 - 684496o PDF
No ratings yet
2020 05 22 - 684496o PDF
2 pages
Office of The Senior Citizens Affairs (Osca)
100% (1)
Office of The Senior Citizens Affairs (Osca)
13 pages
97-680 Multiprime
No ratings yet
97-680 Multiprime
2 pages
Amit Yadav Project
No ratings yet
Amit Yadav Project
49 pages
Notification NIT Calicut Various Vacancy Posts
No ratings yet
Notification NIT Calicut Various Vacancy Posts
6 pages
"Blended Wing Body" (BWD)
No ratings yet
"Blended Wing Body" (BWD)
28 pages
Case Study Analysis On CWO GROUP 8
No ratings yet
Case Study Analysis On CWO GROUP 8
10 pages
Hart Oil & Gas Lawsuit
100% (1)
Hart Oil & Gas Lawsuit
55 pages
Science
No ratings yet
Science
5 pages

Wooldridge 6e Ch05 IM

Uploaded by

Wooldridge 6e Ch05 IM

Uploaded by

57

Property PLIM.2 from Appendix C.

SOLUTIONS TO COMPUTER EXERCISES

C5.1 (i) The estimated equation is

 = −2.87 + .599 educ + .022 exper + .169 tenure

(ii) With log(wage) as the dependent variable, the estimated equation is

C5.2 (i) The regression with all 4,137 observations is

 = 1.392 − .01352 hsperc + .00148 sat

(ii) Using only the first 2,070 observations gives

 = 1.436 − .01275 hsperc + .00147 sat

β 0 β1motheduc + β 2 fatheduc + β3abil + β 4 abil 2 + u

You might also like