0% found this document useful (0 votes)

45 views25 pages

Chapter 5

Panel data analysis involves measuring the same collection of individuals or objects over time. This allows researchers to address more complex problems than would be possible with only cross-sectional or time series data. There are two main approaches to modeling panel data: fixed effects models and random effects models. Fixed effects models control for time-invariant characteristics of individuals through dummy variables, while random effects models treat these characteristics as random variables. The appropriate model depends on whether the time-invariant characteristics are correlated with the independent variables.

Uploaded by

Yohaannis Baayisaa

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

45 views25 pages

Chapter 5

Uploaded by

Yohaannis Baayisaa

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 25

Chapter 5

Panel Data Analysis

1
The Nature of Panel Data

 Panel data, also known as longitudinal data, have both time

series and cross-sectional dimensions.
 They arise when we measure the same collection of people
or objects over a period of time.
 Econometrically, the setup is yit    xit  uit

where yit is the dependent variable,  is the intercept term,

 is a k  1 vector of parameters to be estimated on the
explanatory variables, xit; t = 1, …, T;
i = 1, …, N.

2
The Advantages of using Panel Data

There are a number of advantages from using a full panel

technique when a panel of data is available.
 We can address a broader range of issues and tackle more
complex problems with panel data than would not be
possible with pure time series or pure cross-sectional data
alone.
 It is often of interest to examine how variables, or the
relationships between them, change dynamically (over
time).
 By structuring the model in an appropriate way, we can
remove the impact of certain forms of omitted variables
bias in regression results.
3
Fixed Effects Models

 The fixed effects model for some variable yit may be written
yit    xit  i  vit

 We can think of i as encapsulating all of the variables that

affect yit cross-sectionally but do not vary over time – for
example, the sector that a firm operates in, a person's gender,
or the country where a bank has its headquarters, etc.
 Thus we would capture the heterogeneity that is encapsulated
in i by a method that allows for different intercepts for each
cross sectional unit.
 This model could be estimated using dummy variables, which
would be termed the least squares dummy variable approach.
4
Fixed Effects Models (Cont’d)

 The LSDV model may be written

yit    xit  1 D1i  2 D2i  3 D3i     N DN i  vit
where D1i is a dummy variable that takes the value 1 for all
observations on the first entity (e.g., the first firm) in the
sample and zero otherwise, D2i is a dummy variable that
takes the value 1 for all observations on the second entity
(e.g., the second firm) and zero otherwise, and so on.
 The LSDV can be seen as just a standard regression model
and therefore it can be estimated using OLS.
 Now the model given by the equation above has N+k
parameters to estimate.
5
Time Fixed Effects Models

 It is also possible to have a time-fixed effects model rather

than an entity-fixed effects model.
 We would use such a model where we think that the
average value of yit changes over time but not cross-
sectionally.
 Hence with time-fixed effects, the intercepts would be
allowed to vary over time but would be assumed to be the
same across entities at each given point in time.

6
Time Fixed Effects Models

 We could write a time-fixed effects model as

yit     xit   t  vit
where t is a time-varying intercept that captures all of the
variables that affect y and that vary over time but are constant
cross-sectionally.

 An example would be where the regulatory environment or

tax rate changes part-way through a sample period.

 In such circumstances, this change of environment may

well influence y, but in the same way for all firms. 7

Time Fixed Effects Models (Cont’d)

 Time-variation in the intercept terms can be allowed

for in exactly the same way as with entity fixed effects.

That is, a least squares dummy variable model could be
estimated
yit  xit  1D1t  2 D2t  ...  T DTt  vit
 where D1t, for example, denotes a dummy variable that

takes the value 1 for the first time period and zero
elsewhere, and so on.
8
Time Fixed Effects Models (Cont’d)

 The only difference is that now, the dummy variables

capture time variation rather than cross-sectional variation.

 Similarly, to avoid estimating a model containing all T

dummies, a within transformation can be conducted to

subtract away the cross-sectional averages from each
observation. Finally, it is possible to allow for both entity
fixed effects and time fixed effects within the same model.
Such a model would be termed a two-way error component
9
The Random Effects Model

 An alternative to the fixed effects model described above is

the random effects model, which is sometimes also known

as the error components model.

 As with fixed effects, the random effects approach

proposes different intercept terms for each entity and again
these intercepts are constant over time, with the
relationships between the explanatory and explained
variables assumed to be the same both cross-sectionally
10
and temporally.
The Random Effects Model

 However, the difference is that under the random

effects model, the intercepts for each cross-sectional

unit are assumed to arise from a common intercept 
(which is the same for all cross-sectional units and
over time), plus a random variable i that varies cross-
sectionally but is constant over time.
yit    xit  it , it   i  vit
 i measures the random deviation of each entity’s
11
How the Random Effects Model Works

 Unlike the fixed effects model, there are no dummy

variables to capture the heterogeneity (variation) in the

cross-sectional dimension.

 Instead, this occurs via the i terms.

 Note that this framework requires the assumptions that the

new cross-sectional error term, i, has zero mean, is

independent of the individual observation error term vit, has
constant variance, and is independent of the explanatory
12

variables.
How the Random Effects Model Works

 The parameters ( and the  vector) are estimated consistently but

inefficiently by OLS, and the conventional formulae would have to
be modified as a result of the cross-correlations between error
terms for a given cross-sectional unit at different points in time.

 Instead, a generalised least squares (GLS) procedure is usually

used. The transformation involved in this GLS procedure is to
subtract a weighted mean of the yit over time (i.e. part of the mean
rather than the whole mean, as was the case for fixed effects
estimation). 13
Fixed or Random Effects?

 It is often said that the random effects model is more

appropriate when the entities in the sample can be thought

of as having been randomly selected from the population,
but a fixed effect model is more plausible when the entities
in the sample effectively constitute the entire population.

 More technically, the transformation involved in the GLS

procedure under the random effects approach will not

remove the explanatory variables that do not vary over
14
time, and hence their impact can be enumerated.
Fixed or Random Effects?

 Also, since there are fewer parameters to be estimated with

the random effects model (no dummy variables or within

transform to perform), and therefore degrees of freedom are
saved, the random effects model should produce more
efficient estimation than the fixed effects approach.

 However, the random effects approach has a major drawback

which arises from the fact that it is valid only when the
composite error term it is uncorrelated with all of the
15

explanatory variables.
Fixed or Random Effects? (Cont’d)

 This assumption is more stringent than the corresponding one in

the fixed effects case, because with random effects we thus

require both i and vit to be independent of all of the xit.

 This can also be viewed as a consideration of whether any

unobserved omitted variables (that were allowed for by having

different intercepts for each entity) are uncorrelated with the
included explanatory variables. If they are uncorrelated, a
random effects approach can be used; otherwise the fixed
effects model is preferable. 16
Fixed or Random Effects? (Cont’d)

 A test for whether this assumption is valid for the random effects

estimator is based on a slightly more complex version of the

Hausman test.

 If the assumption does not hold, the parameter estimates will be

biased and inconsistent.

 To see how this arises, suppose that we have only one explanatory

variable, x2it that varies positively with yit, and also with the error
term, it. The estimator will ascribe all of any increase in y to x
when in reality some of it arises from the error term, resulting in
17

biased coefficients.
Fixed or Random Effects? (Cont’d)

 If the regressors are correlated with the ui, the FE

estimator is consistent but the RE estimator is not

consistent

 If the regressors are uncorrelated with the ui, the FE

estimator is still consistent, albeit inefficient, whereas
the RE estimator is consistent and efficient

18
Fixed or Random Effects? (Cont’d)

 Step 1: run a fixed effect model

xtreg lnfdi lngdphome lngdphost, fe
estimate store fe
 Step 2 : run a random effect model
xtreg lnfdi lngdphome lngdphost, re
estimate store ran
 Step 3: conduct Hausman’s test
hausman fe ran
Step 4 : make a decision as to which specification you should use
Notice that if the corresponding probability is < 0.05, Hausman test’s
null hypothesis that the RE estimator is consistent is soundly rejected
 The individual effects do appear to be correlated with the 19
Fixed or Random Effects? (Cont’d)

 Step 1: run a fixed effect model

xtreg trade_openess lnarea landlocked lnpop lngdp_pc lntot, fe
estimate store fix
Step 2 : run a random effect model
xtreg trade_openess lnarea landlocked lnpop lngdp_pc lntot, re
estimate store ran
 Step 3: conduct Hausman’s test
hausman fix ran
Step 4 : make a decision as to which specification you use. If the
corresponding probability is < 0.05, Hausman test’s null hypothesis
that the RE estimator is consistent is rejected, i.e., the individual
effects are correlated with the regressors 20
Fixed or Random Effects? (Cont’d)

Using the macro data, run the following Hausman’s test

rename exporter country
sort country year
sort id year
tsset id year
xtreg trade_openess lnarea landlocked lnpop lngdp_pc lntot, fe
estimate store fix
xtreg trade_openess lnarea landlocked lnpop lngdp_pc lntot, re
estimate store ran
hausman fix ran
21
Fixed or Random Effects? (Cont’d)

---- Coefficients ----

(b) (B) (b-B) sqrt(diag(V_b-V_B))
fix ran Difference S.E.

lnarea -.1430533 -.1846615 .0416082 .8741601

lnpop .6770179 .0990702 .5779477 .0751476
lngdp_pc .2283795 .2091023 .0192772 .037618
lntot -.0824428 .0548649 -.1373078 .014989

b = consistent under Ho and Ha; obtained from xtreg

B = inconsistent under Ha, efficient under Ho; obtained from xtreg

Test: Ho: difference in coefficients not systematic

chi2(4) = (b-B)'[(V_b-V_B)^(-1)](b-B)
= 75.51
Prob>chi2 = 0.0000
(V_b-V_B is not positive definite)

 Conclusion: the Hausman test’s null hypothesis that the RE

estimator is consistent is rejected. i.e., country fixed effects
do appear to be correlated with the regressors  We shall
apply fixed effects model
22
Dynamic Models

 All of the models we have considered so far have

been static, e.g.
yt = 1 + 2x2t + ... + kxkt + ut
 But we can easily extend this analysis to the case
where the current value of yt depends on previous
values of y or one of the x’s, e.g.
yt = 1 + 2x2t + ... + kxkt + 1yt-1 + 2x2t-1 + … + kxkt-1+ ut

 We could extend the model even further by adding

extra lags, e.g. x2t-2 , yt-3 .

23
Why Might we Want/Need To Include Lags in a
Regression?
 Inertia of the dependent variable
 Over-reactions
 However, other problems with the regression could cause
the null hypothesis of no autocorrelation to be rejected:
 Omission of relevant variables, which are themselves
autocorrelated.
 If we have committed a “misspecification” error by
using an inappropriate functional form.
 Autocorrelation resulting from unparameterised
seasonality.
24
Models in First Difference Form

 Another way to sometimes deal with the problem of

autocorrelation is to switch to a model in first
differences.
Denote the first difference of yt, i.e. yt - yt-1 as yt;
similarly for the x-variables, x2t = x2t - x2t-1 etc.
 The model would now be
yt = 1 + 2 x2t + ... + kxkt + ut
 Sometimes the change in y is purported to depend
on previous values of y or xt as well as changes in
x:yt = 1 + 2 x2t + 3x2t-1 +4yt-1 + ut
25

Econometrics by Example PDF
No ratings yet
Econometrics by Example PDF
1 page
Erie Steel Case Presentation: Decision Making With Analytics
No ratings yet
Erie Steel Case Presentation: Decision Making With Analytics
4 pages
ECN3322 - Panel Data-1
No ratings yet
ECN3322 - Panel Data-1
56 pages
Fixed Effects, Random Effects Model Cheat Sheet
100% (1)
Fixed Effects, Random Effects Model Cheat Sheet
4 pages
Panel Data Lecture Notes
No ratings yet
Panel Data Lecture Notes
38 pages
Sample Size R Module
No ratings yet
Sample Size R Module
85 pages
Panel Data Assign
No ratings yet
Panel Data Assign
19 pages
Econometrics II: Panel Data Analysis: First-Differences, Fixed and Random Effects
No ratings yet
Econometrics II: Panel Data Analysis: First-Differences, Fixed and Random Effects
61 pages
Introduction To Panel Data
No ratings yet
Introduction To Panel Data
20 pages
Chapter 4
No ratings yet
Chapter 4
33 pages
Panel Data Models
No ratings yet
Panel Data Models
25 pages
Ch11 - Slides - PA April 2024
No ratings yet
Ch11 - Slides - PA April 2024
27 pages
Chapter 14 Advanced Panel Data Methods: T T Derrorterm Complicate X y
No ratings yet
Chapter 14 Advanced Panel Data Methods: T T Derrorterm Complicate X y
13 pages
Part2 - FEM and REM
No ratings yet
Part2 - FEM and REM
20 pages
Panel Data
No ratings yet
Panel Data
9 pages
Fem & Rem
No ratings yet
Fem & Rem
20 pages
Panel Data Analysis
No ratings yet
Panel Data Analysis
8 pages
Panel Data
100% (2)
Panel Data
5 pages
AE 2023 Lecture10
No ratings yet
AE 2023 Lecture10
40 pages
Panel Data: Fixed and Random Effects: I1 0 I1 0 I I1
No ratings yet
Panel Data: Fixed and Random Effects: I1 0 I1 0 I I1
8 pages
Intro Panel Data by Kurt-Univ Basel
No ratings yet
Intro Panel Data by Kurt-Univ Basel
8 pages
Panal Data Method ch14 PDF
No ratings yet
Panal Data Method ch14 PDF
38 pages
Topic 6 - Static Panel Data
No ratings yet
Topic 6 - Static Panel Data
21 pages
Panel Data Analysis
No ratings yet
Panel Data Analysis
61 pages
Chapter 2 Slides Handout
No ratings yet
Chapter 2 Slides Handout
48 pages
Fere
No ratings yet
Fere
46 pages
Panel Data Analysis
No ratings yet
Panel Data Analysis
9 pages
Econometris II - 4
No ratings yet
Econometris II - 4
26 pages
Materi Teknik Data Panel
No ratings yet
Materi Teknik Data Panel
30 pages
Panel 2 Up
No ratings yet
Panel 2 Up
9 pages
Lesson 07 - Panel Data Regression - 2024
No ratings yet
Lesson 07 - Panel Data Regression - 2024
32 pages
6 Panelmf
No ratings yet
6 Panelmf
18 pages
Lecture 5 - Panel Data Models
No ratings yet
Lecture 5 - Panel Data Models
14 pages
Ch10 Slides .Econometrics - MBA
No ratings yet
Ch10 Slides .Econometrics - MBA
32 pages
12.4 Panel Data _ a Guide on Data Analysis
No ratings yet
12.4 Panel Data _ a Guide on Data Analysis
38 pages
Panel Data Regression Models-Seminar
No ratings yet
Panel Data Regression Models-Seminar
18 pages
Topic 9: Panel Data Models
No ratings yet
Topic 9: Panel Data Models
46 pages
CH 14 Wooldridge 5e PPT
No ratings yet
CH 14 Wooldridge 5e PPT
12 pages
Ch11 Slides
No ratings yet
Ch11 Slides
49 pages
Panel Data Methods
No ratings yet
Panel Data Methods
17 pages
Ch11 Slides
No ratings yet
Ch11 Slides
49 pages
Econometrics II CH-4
No ratings yet
Econometrics II CH-4
25 pages
Week 1
No ratings yet
Week 1
48 pages
Lecture 14 - Panel Data Models
No ratings yet
Lecture 14 - Panel Data Models
40 pages
Note On Panel Data
No ratings yet
Note On Panel Data
19 pages
04 - Panel Data PDF
No ratings yet
04 - Panel Data PDF
84 pages
Panel Data
No ratings yet
Panel Data
105 pages
Block 3
No ratings yet
Block 3
105 pages
Panel Data Model
No ratings yet
Panel Data Model
18 pages
Panel Data Answers
No ratings yet
Panel Data Answers
5 pages
Panel Data Notes
No ratings yet
Panel Data Notes
5 pages
Chapter-11Panel Data
No ratings yet
Chapter-11Panel Data
13 pages
Fixed and Random Effects
No ratings yet
Fixed and Random Effects
23 pages
Chapter 14
No ratings yet
Chapter 14
22 pages
Panel Guidelines
No ratings yet
Panel Guidelines
3 pages
Fixed vs. Random Effects Panel Data Models: Revisiting The Omitted Latent Variables and Individual Heterogeneity Arguments
No ratings yet
Fixed vs. Random Effects Panel Data Models: Revisiting The Omitted Latent Variables and Individual Heterogeneity Arguments
20 pages
Ecotrics (PR) Panel Data 2
No ratings yet
Ecotrics (PR) Panel Data 2
16 pages
Panel Data Modeling and Estimation Process
No ratings yet
Panel Data Modeling and Estimation Process
11 pages
Panel Ecmiic2
No ratings yet
Panel Ecmiic2
57 pages
Topic 4 Panel Regression Model Wble
No ratings yet
Topic 4 Panel Regression Model Wble
34 pages
Unified Supplementary Learning Materials: (Uslem
50% (2)
Unified Supplementary Learning Materials: (Uslem
8 pages
Duke Regression
No ratings yet
Duke Regression
17 pages
MacKinnon Critical Values For Cointegration Tests Qed WP 1227
No ratings yet
MacKinnon Critical Values For Cointegration Tests Qed WP 1227
19 pages
Choosing The Right Statistical Test: Source
No ratings yet
Choosing The Right Statistical Test: Source
4 pages
Z T and Chi-Square Tables
No ratings yet
Z T and Chi-Square Tables
6 pages
Assignment#3 Multiple Regression and Manova 2021
No ratings yet
Assignment#3 Multiple Regression and Manova 2021
9 pages
R M Handout
No ratings yet
R M Handout
13 pages
Exam 2 GNUR 405 2024
No ratings yet
Exam 2 GNUR 405 2024
10 pages
Analysisof Regressionin Game Theory Approach
No ratings yet
Analysisof Regressionin Game Theory Approach
14 pages
Mata Kuliah Seminar Akuntansi: Review Jurnal Asing
No ratings yet
Mata Kuliah Seminar Akuntansi: Review Jurnal Asing
5 pages
Introduction To Econometrics, 5 Edition: Chapter 5: Dummy Variables
No ratings yet
Introduction To Econometrics, 5 Edition: Chapter 5: Dummy Variables
47 pages
1-STATISTICS AND PROBABILITY For Senior Hi
No ratings yet
1-STATISTICS AND PROBABILITY For Senior Hi
52 pages
(SIBTEST), IRTDIF, MH, Differential Item Functioning
No ratings yet
(SIBTEST), IRTDIF, MH, Differential Item Functioning
11 pages
STA-202D: Professor Dr. Md. Azizul Baten
No ratings yet
STA-202D: Professor Dr. Md. Azizul Baten
30 pages
Econometrics Chapter 3
No ratings yet
Econometrics Chapter 3
24 pages
DAVAI Macro
No ratings yet
DAVAI Macro
6 pages
Exercises and Cases in Econometrics
No ratings yet
Exercises and Cases in Econometrics
30 pages
Iannis Xenakis Music Composition Treks: Musical Universes
No ratings yet
Iannis Xenakis Music Composition Treks: Musical Universes
2 pages
An Introduction To Latent Variable Mixture Modeling
No ratings yet
An Introduction To Latent Variable Mixture Modeling
14 pages
Business Statistics (Testing of Hypothesis, Chi Square Correlation)
No ratings yet
Business Statistics (Testing of Hypothesis, Chi Square Correlation)
12 pages
Syeda Saba Ali Asghar Abedi
No ratings yet
Syeda Saba Ali Asghar Abedi
10 pages
Hasil Perhitungan Korelasi Rank Spearman
No ratings yet
Hasil Perhitungan Korelasi Rank Spearman
9 pages
Rank Sum Test
No ratings yet
Rank Sum Test
4 pages
ML - Unit4pdf
No ratings yet
ML - Unit4pdf
65 pages
ECON 241 or ECON C342 - COMPRE ANSWER KEY
No ratings yet
ECON 241 or ECON C342 - COMPRE ANSWER KEY
10 pages
Generalized Ridge Regression Biased Estimation For
No ratings yet
Generalized Ridge Regression Biased Estimation For
23 pages
Gender and Academic Performance
No ratings yet
Gender and Academic Performance
15 pages

Chapter 5

Uploaded by

Chapter 5

Uploaded by

Chapter 5

Panel Data Analysis

 Panel data, also known as longitudinal data, have both time

where yit is the dependent variable,  is the intercept term,

There are a number of advantages from using a full panel

 We can think of i as encapsulating all of the variables that

 The LSDV model may be written

 It is also possible to have a time-fixed effects model rather

 We could write a time-fixed effects model as

 An example would be where the regulatory environment or

 In such circumstances, this change of environment may

well influence y, but in the same way for all firms. 7

 Time-variation in the intercept terms can be allowed

for in exactly the same way as with entity fixed effects.

 The only difference is that now, the dummy variables

capture time variation rather than cross-sectional variation.

 Similarly, to avoid estimating a model containing all T

dummies, a within transformation can be conducted to

 An alternative to the fixed effects model described above is

the random effects model, which is sometimes also known

 As with fixed effects, the random effects approach

 However, the difference is that under the random

effects model, the intercepts for each cross-sectional

 Unlike the fixed effects model, there are no dummy

variables to capture the heterogeneity (variation) in the

 Instead, this occurs via the i terms.

 Note that this framework requires the assumptions that the

new cross-sectional error term, i, has zero mean, is

 The parameters ( and the  vector) are estimated consistently but

 Instead, a generalised least squares (GLS) procedure is usually

 It is often said that the random effects model is more

appropriate when the entities in the sample can be thought

 More technically, the transformation involved in the GLS

procedure under the random effects approach will not

 Also, since there are fewer parameters to be estimated with

the random effects model (no dummy variables or within

 However, the random effects approach has a major drawback

 This assumption is more stringent than the corresponding one in

the fixed effects case, because with random effects we thus

 This can also be viewed as a consideration of whether any

unobserved omitted variables (that were allowed for by having

estimator is based on a slightly more complex version of the

 If the assumption does not hold, the parameter estimates will be

biased and inconsistent.

 If the regressors are correlated with the ui, the FE

estimator is consistent but the RE estimator is not

 If the regressors are uncorrelated with the ui, the FE

 Step 1: run a fixed effect model

 Step 1: run a fixed effect model

Using the macro data, run the following Hausman’s test

---- Coefficients ----

lnarea -.1430533 -.1846615 .0416082 .8741601

b = consistent under Ho and Ha; obtained from xtreg

Test: Ho: difference in coefficients not systematic

 Conclusion: the Hausman test’s null hypothesis that the RE

 All of the models we have considered so far have

 We could extend the model even further by adding

 Another way to sometimes deal with the problem of

You might also like