Introduction To Panel Data UG-students
Introduction To Panel Data UG-students
econometrics
Panel Data Analysis
(PSID)
o National Longitudinal Surveys of
(GSEP)
o The British Household Panel Survey
(BHPS)
o Tanzania National Panel Data (very
recent data)
• Common feature:
o The sample of individuals is typically
o Here the constant terms vary by individual, but the slopes are the
same for all individuals
o A test for individual effects:
§ !! : !! = !! = ⋯ = !!
§ which can be tested using an F-test.
§
o The two-way within transformation removes both observed and
unobserved heterogeneity for both individual and time effects
o Chow Test
o Provides a test of the pooled
o The two-way model can also be estimated using a random effects model
by GLS
o In one-way models the fixed effects are either fixed or random. In a two-
way model the individual and time effects can be fixed or random
o i.e. we may have mixed random effects / fixed effects models
where the time effect is assumed fixed and the individual effect
random for example
o if T is small for example, one may estimate a one-way random
effects model on a set of exogenous variables and time dummies
• Fixed or Random Effects
o Fixed effects allow for arbitrary correlation between the individual
effects and the regressors
o Fixed effects cannot provide estimates of variables that are constant
over time
o The group effect can be thought of as random if we can think of the
sample as being drawn from a larger population.
o Fixed effects model appropriate when differences between
individuals may be viewed as parametric shifts in the regression
function (considered reasonable when the sample covers broadly
exhaustive sample of the population)
– Random effects more applicable when we want to draw
inferences for the whole popula)on
§ where !!" and !!" are the residuals and fitted values
respectively from the within regression
• Policy Analysis with Pooled Cross-Sections
• Difference-in-Difference Estimation
o Methodology
§ Examine the effect of some sort of treatment by
comparing the treatment group after treatment both to
the treatment group before treatment and to some other
control group.
§ Standard case: outcomes are observed for two groups
for two time periods.
• One of the groups is exposed to a treatment in
the second period but not in the first period.
• The second group is not exposed to the
treatment during either period.
§ Structure can apply to repeated cross sections or panel
data.
§ Example:
• Usually related to a so-called natural (or quasi-)
experiment, when some exogenous event – often
a change in government policy – changes the
environment in which individuals, families, firms
or cities operate.
• A state offers a tax break to firms providing employers with health
insurance.
o To estimate the impact of the bill on the percentage of firms
offering health insurance we could use data on a state that
didn’t implement such a law as a control group.
o It is not correct to just compare pre-and post-law changes in
the percentage of firms offering health insurance
o (i.e. ! = !! + !! !2 + !,
§ !ℎ!"! !2 is a dummy for period two.
o Here the coefficient estimate !! gives an estimate of the
difference in the percentage of firms offering health
insurance between periods one and two) since there could be
a trend towards more employers offering health insurance
over time.
• With repeated cross sections, let ! be the control group and !
the treatment group.
• Write
both is not possible – but use xi: reg gdp sav pop i.year i.i
variables GLS ML
constant -0.9731 (0.518) -0.7222 (0.590)
pop -1.7037 (0.076) -1.7303 (0.048)
sav 0.1860 (0.001) 0.1767 (0.000)
Wald chi2 11.54 (0.003) -
LR test - 12.01 (0.003)
Observations 95 95