Chap8-9_Fall20_1129
Chap8-9_Fall20_1129
(8. 1 & 8. 2)
Overview
1. What is it?
2. Consequences of heteroskedasticity (8.1)
3. Implication for computing standard errors (8.2)
4. Testing for heteroskedasticity (8.3)
What …?
If var(u|Xi) is constant – that is, if the variance of the conditional
distribution of u given X does not depend on X – then u is said to
be homoskedastic . Otherwise, u is heteroskedastic .
Heteroskedasticity
Luckily, we can adjust standard errors and t, F statistics so that they are
valid in the presence of heteroscedasticity – heteroscedasticity-robust
procedure.
Formulas for OLS standard errors and related statistics have been
developed that are robust to heteroskedasticity of unknown form.
o heteroskedasticiy-robust standard error;
o heteroskedasticiy-robust t stastistics;
o heteroskedasticiy-robust F stastistics;
In practice:
The homoskedasticity-only formula for the standard error and the
“heteroskedasticity-robust” formula differs – so in general, you get different
standard errors using the different formulas.
------------------------------------------------------------------------------
| Robust
testscr | Coef. Std. Err. t P>|t| [95% Conf. Interval]
-------------+----------------------------------------------------------------
str | -1.101296 .4328472 -2.54 0.011 -1.95213 -.2504616
pctel | -.6497768 .0310318 -20.94 0.000 -.710775 -.5887786
_cons | 686.0322 8.728224 78.60 0.000 668.8754 703.189
------------------------------------------------------------------------------
-------------------------------------------------------------------------
If you use the “, robust” option, STATA computes heteroskedasticity-robust standard errors
Otherwise, STATA computes homoskedasticity-only standard errors
Heteroskedasticity-robust Inference
Under MLR.4
Restricted model
(regression on constant)
2 General
�𝑆𝑆𝑆𝑆𝑅𝑅𝑟𝑟 − 𝑆𝑆𝑆𝑆𝑅𝑅𝑢𝑢𝑢𝑢 )⁄𝑞𝑞 �𝑅𝑅𝑢𝑢𝑢𝑢 − 𝑅𝑅𝑟𝑟2 )⁄𝑞𝑞
𝐹𝐹 = = 2 ⁄ ∼ 𝐹𝐹q,𝑛𝑛−𝑘𝑘−1 Derive
𝑆𝑆𝑆𝑆 𝑅𝑅𝑢𝑢𝑢𝑢 ⁄( 𝑛𝑛 − 𝑘𝑘 − 1) 1 − 𝑅𝑅𝑢𝑢𝑢𝑢 ) ( 𝑛𝑛 − 𝑘𝑘 − 1
k restrictions so q=k
a
LM ~ χ q2 , so can choose a critical
value, c, from a χ q2 distribution, or
just calculate a p - value for χ q2
8.14
If we reject the BP test, some corrective measure should be taken, for example,
use the heteroskedasticity-robust standard errors. (More on 8.4 WLS)
Testing for Heteroskedasticity
Heteroskedasticity
auxiliary regression
8.20
8.20
HW: 1, 4
More on
Chapter 9 Specification
and Data Issues
© 2016 Cengage Learning ® . May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a
certain product or service or otherwise on a password-protected website or school-approved learning management system for classroom use. © kentoh/Shutterstock.
More on Specification and Data
Issues
● Tests for functional form misspecification
• One can always test whether explanatory should appear as squares
or higher order terms by testing whether such terms can be
excluded
• Otherwise, one can use general specification tests such as RESET
Test for the exclusion of these terms. If they cannot be excluded, this is evidence for
omitted higher order terms and interactions, i.e. for misspecification of functional form.
© 2016 Cengage Learning ® . May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license
distributed with a certain product or service or otherwise on a password-protected website or school-approved learning management system for classroom use.
More on Specification and Data
Issues
● Example: Housing price equation
Evidence for
misspecification
Model 2:
Define a general model that contains both models as subcases and test:
© 2016 Cengage Learning ® . May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license
distributed with a certain product or service or otherwise on a password-protected website or school-approved learning management system for classroom use.
More on Specification and Data
Issues
● Using proxy variables for unobserved explanatory variables
In general, the estimates for the returns to education and experience will be biased because
one has omit the unobservable ability variable. Idea: find a proxy variable for ability which is
able to control for ability differences between individuals so that the coefficients of the other
variables will not be biased. A possible proxy for ability is the IQ score or similar test scores.
• The proxy variable is a “good” proxy for the omitted variable, i.e.
using other variables (x1,x2) in addition will not help to predict the
omitted variable p310
© 2016 Cengage Learning ® . May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license
distributed with a certain product or service or otherwise on a password-protected website or school-approved learning management system for classroom use.
More on Specification and Data
Issues
● Under these assumptions, the proxy variable method works:
Plug (2)
into (1)
In this regression model, the error term is uncorrelated with all the explanatory variables. As
a consequence, all coefficients will be correctly estimated using OLS. The coefficents for the
explanatory variables x1 and x2 will be correctly identified. The coefficient for the proxy va-
riable may also be of interest (it is a multiple, δ3 of the coefficient of the omitted variable).
• Including the past crime rate will at least partly control for the many
omitted factors that also determine the crime rate in a given year
• Another way to interpret this equation is that one compares cities
which had the same crime rate last year; this avoids comparing
cities that differ very much in unobserved crime factors
Ex: Some universities are traditionally better in academics than other
universities. Inertial effects are also captured by putting in lags of y.
© 2016 Cengage Learning ® . May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license
distributed with a certain product or service or otherwise on a password-protected website or school-approved learning management system for classroom use.
Example: City crime rates
Unemployment rate & expenditures on law enforcement are counterintuitive in column (1).
With lagged dep. Vars., signs of unem & log (lawexpc) are corrected. R-squared is
increased a lot. The + sign of crmrte82 shows highly correlation with current crime rate.
© 2016 Cengage Learning ® . May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license
distributed with a certain product or service or otherwise on a password-protected website or school-approved learning management system for classroom use.
HW
HW: 1, 3
© 2016 Cengage Learning ® . May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license
distributed with a certain product or service or otherwise on a password-protected website or school-approved learning management system for classroom use.