slides-6-iu

The document discusses panel data models, including pooled OLS, fixed effects, and random effects models, along with their advantages and disadvantages. It highlights the importance of the Hausman test to determine the consistency of estimates between fixed and random effects models. Additionally, it provides examples of panel data using provincial data from Vietnam and outlines model specifications and estimation methods.

Uploaded by

Ngô Trâm

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

4 views

slides-6-iu

Uploaded by

Ngô Trâm

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 38

PANEL DATA MODELS

Trương Đăng Thụy

[email protected]
1 - Panel data
2 - Pooled OLS estimator
3 - Fixed effects model
4 - Random effects model
5 - FE vs RE: Hausman test
6 - Between group estimator

COVERED IN THIS LECTURE

PANEL DATA
PANEL DATA
▪ cross-sectional: MANY units and ONE time period
𝑦𝑖 with 𝑖 = 1, … 𝑁
▪ time-series: ONE unit and SEVERAL time periods
𝑦𝑡 with 𝑡 = 1, … , 𝑇
▪ panel data: data on MANY units and SEVERAL time periods
𝑦𝑖𝑡 with 𝑖 = 1, … , 𝑁 and 𝑡 = 1, . . , 𝑇
EXAMPLE DATA
Viet Nam Provincial data on
▪ rgdp: provincial GDP (bil. VND)
▪ labfo: number of laborers of provinces (1000 persons)
▪ rinvest: gross investment of provinces (bil. VND)
▪ pci: 100-point scaled composite index measuring and ranking Vietnam’s provinces
based on their overall economic governance quality
▪ data for 58 provinces, 5 years (2007-2011)
EXAMPLE PANEL DATA
provcode province year rgdp labfo rinvest pci
An Giang 1 2007 22000000 1221.3 5600000 66.4688
1 2008 25000000 1244.9 4600000 61.1247
1 2009 25000000 1227.3 4800000 58.177
1 2010 27000000 1255 4500000 61.9379
1 2011 29000000 1300.4 3900000 62.22
Bac Can 2 2007 1500000 177.2 592714 46.4687
2 2008 2000000 179.8 1100000 39.7762
2 2009 2400000 189.8 1100000 75.9563
2 2010 3400000 194 2600000 51.4864
2 2011 4200000 199.6 2900000 52.71
... ... ... ... ... ... ...
ADVANTAGES AND DISADVANTAGES
OF PANEL DATA
▪ Advantages
▪ More observations
▪ More variability
▪ Less collinearity between regressors
▪ Control of individual heterogeneity
▪ Reduce biases

▪ Disadvantages
▪ Require more efforts collecting data
▪ Selectivity biases
PANEL DATA MODEL
REQUIRES WITHIN GROUP VARIATION
▪ Panel data model (FE) requires variation within group
▪ An example where panel data does not work
𝑦𝑖𝑡 = 𝛼 + 𝛽𝑥𝑖𝑡 + 𝑢
▪ 𝑦𝑖𝑡 is export volume from VN to country 𝑖 in year 𝑡
▪ 𝑥𝑖𝑡 is the distance from VN to country 𝑖 in year 𝑡
▪ As distance from VN to country 𝑖 does not change from year to year, it can’t be
included in the fixed effect model.
THE DATA
▪ Provincial data 2007 – 2011
▪ rgdp: regional GDP (mil. VND)
▪ rinvest: investment (mil. VND)
▪ labfo: labor force (thousand workers)
▪ pci: Provicial Competitive Index (range 0-100)

▪ Source: https://kinhteluong.online/esdata/iu/panel.csv
SUMMARY STATISTICS
MODEL SPECIFICATION
▪ In this lecture we will consider the specification

𝑦𝑖𝑡 = 𝛼 + 𝛽𝑋𝑖𝑡 + 𝑢
▪ 𝑦𝑖𝑡 is the logarithm of real GDP of province 𝑖 in year 𝑡
▪ 𝑋𝑖𝑡 includes
▪ Logarithm of the labor force
▪ Logarithm of real investment
▪ Provincial competitiveness index (PCI)
POOLED OLS ESTIMATOR
POOLED OLS ESTIMATOR
▪ Data of all groups are pooled together
▪ No difference between groups

𝑦𝑖𝑡 = 𝛼 + 𝛽𝑋𝑖𝑡 + 𝑢𝑖𝑡

▪ Coefficients are identical for all groups.
▪ Some assumptions:
▪ The error term is not autocorrelated and homoscedastic
▪ 𝑋 is nonstochastic and not correlate with 𝑢 (𝑋 is strictly exogenous)
THE POOLED OLS IN R
POOLED OLS WITH ROBUST STANDARD ERRORS
CLUSTERED STANDARD ERRORS
▪ The Pooled OLS estimator (and other panel data models) assumes no correlation
between residuals of the same group
▪ If we relax the assumption, then

cov 𝑢𝑖𝑡 , 𝑢𝑖𝑠 ≠ 0

▪ We then have heteroskedasticity and autocorrelation
▪ If this happens, the Pooled OLS estimator is still consistent, but the standard errors
are incorrect.
▪ In this case we may use the clustered robust standard errors.
POOLED OLS WITH CLUSTERED STANDARD ERRORS
POOLED OLS
USING PACKAGE PLM
FIXED EFFECTS MODEL

Within group estimator

THE FIXED EFFECTS MODEL
▪ The model

𝑦𝑖𝑡 = 𝛼𝑖 + 𝛽𝑋𝑖𝑡 + 𝑢𝑖𝑡

▪ The slopes are still identical for all groups.
▪ But each group has a different intercept.
▪ These intercepts are called fixed effects, which capture individual heterogeneity.
▪ Two estimators:
▪ Fixed effects estimator (within group)
▪ Least square dummy variable estimator

▪ Note: these are the two ways of estimating the FE model, not two different models.
WITHIN GROUP FIXED EFFECTS ESTIMATOR
▪ The model
𝑦𝑖𝑡 = 𝛼𝑖 + 𝛽𝑋𝑖𝑡 + 𝑢𝑖𝑡 (1)
▪ We need to allow for the intercept to vary across groups.
▪ Now take the average of variables across time, note that the parameters are time-
invariant
𝑦ത𝑖𝑡 = 𝛼𝑖 + 𝛽 𝑋ത𝑖𝑡 + 𝑢ത 𝑖𝑡 (2)
1 1
▪ where 𝑦ത𝑖𝑡 = σ𝑇𝑡=1 𝑦𝑖𝑡 and 𝑋ത𝑖𝑡 = σ𝑇𝑡=1 𝑋𝑖𝑡
𝑇 𝑇
▪ Then subtract (2) from (1)
𝑦𝑖𝑡 − 𝑦ത𝑖𝑡 = 𝛼𝑖 − 𝛼𝑖 + 𝛽 𝑋𝑖𝑡 − 𝑋ത𝑖𝑡 + 𝑢𝑖𝑡 − 𝑢ത 𝑖𝑡
▪ Which results in
𝑦ු𝑖𝑡 = 𝛽 𝑋ෘ𝑖𝑡 + 𝑢ු 𝑖𝑡
▪ With this way we can estimate 𝛽 but not the fixed effects.
WITHIN GROUP
FIXED EFFECTS
ESTIMATOR
WITHIN GROUP
FIXED EFFECTS
ESTIMATOR
robust standard
errors
WITHIN GROUP
FIXED EFFECTS
ESTIMATOR
clustered standard
errors
LEAST SQUARES DUMMY VARIABLE ESTIMATOR
▪ For the model
𝑦𝑖𝑡 = 𝛼𝑖 + 𝛽𝑋𝑖𝑡 + 𝑢𝑖𝑡
▪ we can estimate the fixed effects and 𝛽 by introducing the dummy variables
1 if 𝑗 = 𝑖
𝐷𝑗𝑖 =
0 otherwise
▪ We can then estimate the following model using OLS
𝑁

𝑦𝑖𝑡 = ෍ 𝛼𝑗 𝐷𝑗𝑖 + 𝛽𝑋𝑖𝑡 + 𝑢𝑖𝑡

𝑗=1
▪ This is the least squares dummy variable (LSDV) estimator.
▪ The LSDV estimates are identical to the within group FE estimates.
▪ However, LSDV estimate the fixed effects.
▪ On the other hand, LSDV is not feasible when 𝑁 is large.
LSDV
FIXED EFFECTS
ESTIMATOR

some factors omitted

LSDV
FIXED EFFECTS
ESTIMATOR
robust standard
errors
LSDV
FIXED EFFECTS
ESTIMATOR
clustered standard
errors
LSDV TWO-WAY FIXED EFFECTS MODEL
▪ The model now includes time fixed effects

𝑁 𝑇

𝑦𝑖𝑡 = ෍ 𝛼𝑗 𝐷𝑗𝑖 + ෍ 𝛾𝑔 𝐷𝑔𝑡 + 𝛽𝑋𝑖𝑡 + 𝑢𝑖𝑡

𝑗=1 𝑔=1

where
1 if 𝑔 = 𝑡
𝐷𝑔𝑡 =
0 otherwise
LSDV TWO-WAY
FIXED EFFECTS MODEL

some factors omitted

RANDOM EFFECTS MODEL
▪ The random effects model is presented by
𝑦𝑖𝑡 = 𝛼 + 𝛽𝑋𝑖𝑡 + 𝑢𝑖𝑡
▪ The error component now includes
𝑢𝑖𝑡 = 𝜇𝑖 + 𝜖𝑖𝑡
▪ 𝜇𝑖 ~𝑁 0, 𝜎𝜇2 the individual specific random component
▪ 𝜖𝑖𝑡 ~𝑁 0, 𝜎𝜖2 the idiosyncratic disturbance
▪ In the random effects model, regressors can be time-invariant
▪ Estimation method: generalized least squares
RANDOM EFFECTS MODEL
RANDOM EFFECT
MODEL
clustered
standard errors
RANDOM VS. FIXED EFFECTS
RANDOM VS. FIXED EFFECTS
▪ The main difference is that the individual effects are assumed:
▪ fixed in FE
▪ random in RE.

▪ The random effects model is preferred for

▪ The fixed effects vary over time.
▪ It is more efficient (higher degree of freedom)
▪ It allows time-invariant regressors

▪ RE estimates, however, are inconsistent if assumption (error term is not correlated

with individual effects) is violated
▪ Which one should we use? Hausman test!
HAUSMAN TEST
▪ Null hypothesis:
▪ Estimates of RE and FE are not systematically different, or
▪ both RE and FE estimates are consistent

▪ Alternative hypothesis: RE estimates are inconsistent

▪ Test statistics
′ −1
𝐻 = 𝛽𝐹𝐸 − 𝛽𝑅𝐸 𝑉 𝛽𝐹𝐸 − 𝑉 𝛽𝑅𝐸 𝛽𝐹𝐸 − 𝛽𝑅𝐸
▪ which follows 𝜒 2 with df = number of regressors. Note that 𝑉(𝛽) is the variance
covariance matrix.
▪ We reject H0 if p-value is small.
▪ If reject H0: estimates of RE and FE are different, and so RE estimates are inconsistent.
▪ If not reject H0: RE and FE estimates are not different, so both are good. But remember
that RE estimates are more efficient.
HAUSMAN TEST IN R
NOTES ON HAUSMAN TEST
▪ We can test only with the same set of regressors.
▪ If we include time-invariant regressor in the RE model (which is not possible in FE
model), then Hausman test fails.
▪ Hausman test check whether the two estimates are equal.
▪ If we reject the null hypothesis, the FE estimates are consistent and the RE model is
not.
▪ Important: if any regressor is correlated with the error term, both estimates are
biased.

Fixed Effects, Random Effects Model Cheat Sheet
100% (1)
Fixed Effects, Random Effects Model Cheat Sheet
4 pages
Panel Data Lecture Notes
No ratings yet
Panel Data Lecture Notes
38 pages
QWE Case Study
No ratings yet
QWE Case Study
5 pages
Lecture 14 - Panel data models
No ratings yet
Lecture 14 - Panel data models
40 pages
Fere
No ratings yet
Fere
46 pages
Topic 9: Panel Data Models
No ratings yet
Topic 9: Panel Data Models
46 pages
Panel Data Assign
No ratings yet
Panel Data Assign
19 pages
Some Basics For Panel Data Analysis
No ratings yet
Some Basics For Panel Data Analysis
21 pages
Panel Data Models
No ratings yet
Panel Data Models
25 pages
Topic 6 - Static Panel Data
No ratings yet
Topic 6 - Static Panel Data
21 pages
Fem & Rem
No ratings yet
Fem & Rem
20 pages
Introduction To Panel Data
No ratings yet
Introduction To Panel Data
20 pages
1669594424_72__UE_panelv3
No ratings yet
1669594424_72__UE_panelv3
35 pages
Fixed and Random Effects: Jos Elkink
No ratings yet
Fixed and Random Effects: Jos Elkink
121 pages
Intro Panel Data by Kurt-Univ Basel
No ratings yet
Intro Panel Data by Kurt-Univ Basel
8 pages
Panel Data: Fixed and Random Effects: I1 0 I1 0 I I1
No ratings yet
Panel Data: Fixed and Random Effects: I1 0 I1 0 I I1
8 pages
1170_10045_136696 (2)
No ratings yet
1170_10045_136696 (2)
61 pages
2025 Static Panels
No ratings yet
2025 Static Panels
19 pages
Chapter 2_Panel Data Regression
No ratings yet
Chapter 2_Panel Data Regression
30 pages
Lecture Series 1 Linear Random and Fixed Effect Models and Their (Less) Recent Extensions
No ratings yet
Lecture Series 1 Linear Random and Fixed Effect Models and Their (Less) Recent Extensions
62 pages
panel2up
No ratings yet
panel2up
9 pages
Panel Data
100% (1)
Panel Data
13 pages
6 panelmf
No ratings yet
6 panelmf
18 pages
C6 - English
No ratings yet
C6 - English
18 pages
PLM
No ratings yet
PLM
51 pages
Panel Data Analysis
No ratings yet
Panel Data Analysis
61 pages
Lectute 2 - Panel Data Regression
No ratings yet
Lectute 2 - Panel Data Regression
30 pages
Chapter 2 Slides Handout
No ratings yet
Chapter 2 Slides Handout
48 pages
14 Panel Data Models
No ratings yet
14 Panel Data Models
31 pages
Chapter 5
No ratings yet
Chapter 5
25 pages
Week 3-1
No ratings yet
Week 3-1
25 pages
Plm
No ratings yet
Plm
51 pages
AE 2023 Lecture10
No ratings yet
AE 2023 Lecture10
40 pages
Topic 4 Panel Regression Model Wble
No ratings yet
Topic 4 Panel Regression Model Wble
34 pages
Panel Data For Learing
100% (2)
Panel Data For Learing
34 pages
Panel Cookbook
No ratings yet
Panel Cookbook
98 pages
Croissant y Millo, Panel Data Econometrics
100% (1)
Croissant y Millo, Panel Data Econometrics
52 pages
Block 3
No ratings yet
Block 3
105 pages
Panel Data Econometrics in R: The PLM Package: Yves Croissant Giovanni Millo
No ratings yet
Panel Data Econometrics in R: The PLM Package: Yves Croissant Giovanni Millo
51 pages
Ch11_slides_PA April 2024 (2)
No ratings yet
Ch11_slides_PA April 2024 (2)
27 pages
Panel Data I
No ratings yet
Panel Data I
40 pages
Summary-Econometric Analysis of Panel Data-Summary
No ratings yet
Summary-Econometric Analysis of Panel Data-Summary
12 pages
Week 2
No ratings yet
Week 2
61 pages
ECMT6007/ECON4954: Panel Data Econometrics: Lecture 3: Pooled OLS, LSDV and FD Estimators
No ratings yet
ECMT6007/ECON4954: Panel Data Econometrics: Lecture 3: Pooled OLS, LSDV and FD Estimators
48 pages
Panel Ecmiic2
No ratings yet
Panel Ecmiic2
57 pages
Advanced Econometrics: Based On The Textbook by Verbeek: A Guide To Modern Econometrics
No ratings yet
Advanced Econometrics: Based On The Textbook by Verbeek: A Guide To Modern Econometrics
24 pages
Econometric Methods For Panel Data
No ratings yet
Econometric Methods For Panel Data
58 pages
Note On Panel Data
No ratings yet
Note On Panel Data
19 pages
Panal Data Method ch14 PDF
No ratings yet
Panal Data Method ch14 PDF
38 pages
Week 1
No ratings yet
Week 1
48 pages
Panel Data Model
No ratings yet
Panel Data Model
18 pages
Lecture 11
No ratings yet
Lecture 11
4 pages
Panel Data Analysi
No ratings yet
Panel Data Analysi
27 pages
Econometris II - 4
No ratings yet
Econometris II - 4
26 pages
Econometrics - Review Sheet ' (Main Concepts)
No ratings yet
Econometrics - Review Sheet ' (Main Concepts)
5 pages
LN 13
No ratings yet
LN 13
8 pages
Clustering in The Linear Model
No ratings yet
Clustering in The Linear Model
11 pages
Panel Data Lecture Rome
No ratings yet
Panel Data Lecture Rome
47 pages
Gre Formula Book
From Everand
Gre Formula Book
Saifuddin Kamran
No ratings yet
GCSE Maths Revision: Cheeky Revision Shortcuts
From Everand
GCSE Maths Revision: Cheeky Revision Shortcuts
Scool Revision
3.5/5 (2)
A Conversation About Calculus
From Everand
A Conversation About Calculus
Ginachukwu Amah
No ratings yet
How To Choose The Right Statistical Tool in Prism
No ratings yet
How To Choose The Right Statistical Tool in Prism
3 pages
MF821 Syllabus
No ratings yet
MF821 Syllabus
5 pages
GSEMModellingusingStata PDF
No ratings yet
GSEMModellingusingStata PDF
97 pages
School of Business and Economics Department of Economics: Syed - Ehsan@northsouth - Edu
No ratings yet
School of Business and Economics Department of Economics: Syed - Ehsan@northsouth - Edu
5 pages
SDSC3006_Assignment 1
No ratings yet
SDSC3006_Assignment 1
2 pages
Levine Bsfc7ge Ch12 1
No ratings yet
Levine Bsfc7ge Ch12 1
93 pages
Regression Solved Example
No ratings yet
Regression Solved Example
3 pages
Causes and Consequences of The Protestant Reformation: Sascha O. Becker Steven Pfaff
No ratings yet
Causes and Consequences of The Protestant Reformation: Sascha O. Becker Steven Pfaff
52 pages
11 Classical Time Series Forecasting Methods in Python (Cheat Sheet)
No ratings yet
11 Classical Time Series Forecasting Methods in Python (Cheat Sheet)
5 pages
Econometrics Chapter Two
No ratings yet
Econometrics Chapter Two
108 pages
REVIEW OF CLRMs
No ratings yet
REVIEW OF CLRMs
53 pages
Estimation of Parameters
No ratings yet
Estimation of Parameters
2 pages
Homework 9 QMB 3200
No ratings yet
Homework 9 QMB 3200
22 pages
Regression
No ratings yet
Regression
1 page
MA Economics CBCS 2023 24 With Objectives
No ratings yet
MA Economics CBCS 2023 24 With Objectives
34 pages
Confidence Interval For Printing
No ratings yet
Confidence Interval For Printing
6 pages
Introduction To Econometrics - Stock & Watson - CH 10 Slides
No ratings yet
Introduction To Econometrics - Stock & Watson - CH 10 Slides
99 pages
Introduction To Regression With Statsmodels in Python
No ratings yet
Introduction To Regression With Statsmodels in Python
142 pages
Detecting and Resolving Model Specification Errors in STATA
No ratings yet
Detecting and Resolving Model Specification Errors in STATA
7 pages
11-Simple Linear Regression
No ratings yet
11-Simple Linear Regression
25 pages
Granger Causality Test
100% (1)
Granger Causality Test
3 pages
PSM in Stata
No ratings yet
PSM in Stata
64 pages
Chap 1,2,3,5,6 (QA) Upload
No ratings yet
Chap 1,2,3,5,6 (QA) Upload
6 pages
Econometrics: Problem Set 3: Professor: Mauricio Sarrias
No ratings yet
Econometrics: Problem Set 3: Professor: Mauricio Sarrias
5 pages
Amos 1
No ratings yet
Amos 1
67 pages
Uji Stasioner: - Import Excel "C:/Users/Acer/Documents/data1.xlsx", Sheet ("Sheet1") Cellrange (A1:J32) Firstrow
No ratings yet
Uji Stasioner: - Import Excel "C:/Users/Acer/Documents/data1.xlsx", Sheet ("Sheet1") Cellrange (A1:J32) Firstrow
3 pages
Functional Forms of Regression
No ratings yet
Functional Forms of Regression
11 pages
Applied Robust Statistics
No ratings yet
Applied Robust Statistics
532 pages
Chapter 5 Testing For Linear Restrictions and Structural Change
No ratings yet
Chapter 5 Testing For Linear Restrictions and Structural Change
7 pages