Introduction To Panel Data
Introduction To Panel Data
Introduction
• Describe what panel data is and the
reasons for using it in this format
• Assess the importance of fixed and
random effects
• Examine the Hausman test, which
determines if fixed or random effects
should be used.
• Evaluate some panel data models
Panel Data
• These are Models that Combine
Cross-section and Time-Series Data
• In panel data the same cross-sectional
unit (industry, firm, country) is surveyed
over time, so we have data which is
pooled over space as well as time.
Reasons for using Panel Data
1. Panel data can take explicit account of
individual-specific heterogeneity (“individual”
here means related to the microunit)
2. By combining data in two dimensions, panel
data gives more data variation, less collinearity
and more degrees of freedom.
3. Panel data is better suited than cross-sectional
data for studying the dynamics of change. For
example it is well suited to understanding
transition behaviour – for example company
bankruptcy or merger.
4. Panel data is better at detecting and
measuring effects that cannot be observed
in either cross-section or time-series data.
5. Panel data enables the study of more
complex behavioural models – for example
the effects of technological change, or
economic cycles.
6. Panel data can minimise the effects of
aggregation bias, from aggregating firms
into broad groups.
If all the cross-sectional units have the same number of time
series observations the panel is balanced, if not it is
unbalanced.
Cross section
Time
series
x
Fixed Effects Estimation
The previous slide suggests that a better way to model the
data would be to allow each group (firm) to have its own
intercept:
This is know as the (One Way) Fixed Effects Model.
How do we estimate it?
The simplest way to allow each firm to have its own intercept
is to create a set of dummy (binary) variables, one for each
firm, and include them as regressors.
So: