2SLS Notes
2SLS Notes
1
Organization
• Introduction
• Endogeneity problem
• 2SLS
• Hands on using Stata
• Interpretation
2
Regression with Instrumental Variables
3
Regression with Instrumental Variables (cont.’)
4
Endogeneity: An Example
Standard regression: y = xb + u
no association between x and u; OLS consistent
x y
u
Endogeneity: y = xb + u
correlation between x and u; OLS inconsistent
x y
5
Endogeneity
6
Instrumental Variables Regression
The solution provided by IV methods may be viewed as:
u
The additional variable z is termed an instrument for x. In general, we
have many variables in x, and more than one x correlated with u.
In that case, we shall need at least that many variables in z.
7
Choice of instruments
8
Instrumental Variables Models
9
Instrumental Variables Terminology
10
Different Types of Instrumental Variables Estimators
Wald estimator for binary instrument:
Set of instruments:
12
Different Types of Instrumental Variables Estimators
14
Including Control Variables in an 2SLS Model
Second stage:
15
Technical Conditions Required for Model Identification
16
Instrumental Variables and Randomized Experiments
18
Results for 2SLS (Cross-section)
19
Post-estimation: Tests of Over-identifying Restriction
20
Post-estimation: First-Stage Regression
The first-stage
regression should
be significant.
21
Post-estimation: Tests of Endogeneity
Durbin-Hausman-Wu tests
22
2SLS in Cross-Section using ivreg2 in STATA
23
2SLS from ivreg2 Results: First-Stage
24
Post-estimation: First-Stage
• Sanderson-Windmeijer multivariate F test of excluded
instruments
• Under-identification test: Anderson canonical correlation
LM statistic
• Weak identification test: Cragg-Donald Wald F statistic
• Weak-instruments-robust inference: Anderson-Rubin
Wald test
25
Sanderson-Windmeijer Multivariate F-test of Excluded
Instruments
• The Sanderson-Windmeijer (SW) first-stage chi-squared and F statistics
are tests of underidentification and weak identification, respectively, of
individual endogenous regressors.
• SW Chi-squared:
• Ho: Individual endogenous regressor is under-identified.
• H1: Individual endogenous regressor is over-identified.
• SW F-statistics:
• Ho: Individual endogenous regressor is weak identification.
• H1: Individual endogenous regressor is strong identification.
26
Anderson Canonical Correlation LM Statistics
27
First-Stage Post-estimation (1)
28
Weak identification test: Cragg-Donald Wald F-statistic
H0: Equation is weakly identified
H1: Equation is strongly identified
29
Stock and Yogo (2005) Critical Values
Stock and Yogo (2005) provide critical values that depend
on:
• The number of endogeneous regressors,
• The number of instruments
• The maximum bias
• The estimation procedure (e.g. 2SLS, LIML, …)
30
Weak-instruments-robust inference:
Anderson-Rubin Wald test
31
First-Stage Post-estimation (2)
32
2SLS from ivreg2 Results
33
Post-estimation: Two-Stage Least Squares
• Under-identification test: Anderson canonical correlation
LM statistic
• Weak identification test: Cragg-Donald Wald F statistic
• Over-identification test of all instruments: Sargan
statistic
• Endogeneity test: Wu-Hausman F test
34
Sargan Test
• Sargan test is the test of over-identification for all
instruments
35
Endogeneity Test: Wu-Hausman Test
• Wu-Hausman test is the test of endogeneity
36
Post-estimation: Two-Stage Least Squares
37
An Example for Two-Stage Least Squares: Panel Data
• 64 Countries
• Year 2009-2013
• GDP equation
38
Result for First-Stage Least Squares: Panel Data
39
Post-Estimation (1) First-Stage Least Squares: Panel Data
40
Post-Estimation (2) First-Stage Least Squares: Panel Data
41
Result for Two-Stage Least Squares: Panel Data
42
Post-Estimation Two-Stage Least Squares: Panel Data
43
Q&A
44