Estimation of Nonstationary Heterogeneous Panels
Estimation of Nonstationary Heterogeneous Panels
Editor
H. Joseph Newton
Department of Statistics
Texas A & M University
College Station, Texas 77843
979-845-3142; FAX 979-845-3144
[email protected]
Editor
Nicholas J. Cox
Department of Geography
Durham University
South Road
Durham City DH1 3LE UK
[email protected]
Associate Editors
Christopher F. Baum
Boston College
Rino Bellocco
Karolinska Institutet, Sweden and
Univ. degli Studi di Milano-Bicocca, Italy
A. Colin Cameron
University of CaliforniaDavis
David Clayton
Cambridge Inst. for Medical Research
Mario A. Cleves
Univ. of Arkansas for Medical Sciences
William D. Dupont
Vanderbilt University
Charles Franklin
University of WisconsinMadison
Joanne M. Garrett
University of North Carolina
Allan Gregory
Queens University
James Hardin
University of South Carolina
Ben Jann
ETH Z
urich, Switzerland
Stephen Jenkins
University of Essex
Ulrich Kohler
WZB, Berlin
J. Scott Long
Indiana University
Thomas Lumley
University of WashingtonSeattle
Roger Newson
Imperial College, London
Marcello Pagano
Harvard School of Public Health
Sophia Rabe-Hesketh
University of CaliforniaBerkeley
J. Patrick Royston
MRC Clinical Trials Unit, London
Philip Ryan
University of Adelaide
Mark E. Schaffer
Heriot-Watt University, Edinburgh
Jeroen Weesie
Utrecht University
Nicholas J. G. Winter
University of Virginia
Jeffrey Wooldridge
Michigan State University
Lisa Gilmore
Gabe Waggoner
Jens Lauritsen
Odense University Hospital
Stanley Lemeshow
Ohio State University
Copyright Statement: The Stata Journal and the contents of the supporting files (programs, datasets, and
c by StataCorp LP. The contents of the supporting files (programs, datasets, and
help files) are copyright
help files) may be copied or reproduced by any means whatsoever, in whole or in part, as long as any copy
or reproduction includes attribution to both (1) the author and (2) the Stata Journal.
The articles appearing in the Stata Journal may be copied or reproduced as printed copies, in whole or in part,
as long as any copy or reproduction includes attribution to both (1) the author and (2) the Stata Journal.
Written permission must be obtained from StataCorp if you wish to make electronic copies of the insertions.
This precludes placing electronic copies of the Stata Journal, in whole or in part, on publicly accessible web
sites, fileservers, or other locations where the copy may be accessed by anyone other than the subscriber.
Users of any of the software, ideas, data, or other materials published in the Stata Journal or the supporting
files understand that such use is made without warranty of any kind, by either the Stata Journal, the author,
or StataCorp. In particular, there is no warranty of fitness of purpose or merchantability, nor for special,
incidental, or consequential damages such as loss of profits. The purpose of the Stata Journal is to promote
free communication among Stata users.
The Stata Journal, electronic version (ISSN 1536-8734) is a publication of Stata Press. Stata and Mata are
registered trademarks of StataCorp LP.
Mark W. Frank
Sam Houston State University
Huntsville, TX
[email protected]
Abstract. We introduce a new Stata command, xtpmg, for estimating nonstationary heterogeneous panels in which the number of groups and number of time-series
observations are both large. Based on recent advances in the nonstationary panel
literature, xtpmg provides three alternative estimators: a traditional fixed-effects
estimator, the mean-group estimator of Pesaran and Smith (Estimating long-run
relationships from dynamic heterogeneous panels, Journal of Econometrics 68:
79113), and the pooled mean-group estimator of Pesaran, Shin, and Smith (Estimating long-run relationships in dynamic heterogeneous panels, DAE Working
Papers Amalgamated Series 9721; Pooled mean group estimation of dynamic heterogeneous panels, Journal of the American Statistical Association 94: 621634).
Keywords: st0125, xtpmg, nonstationary panels, heterogeneous dynamic panels,
pooled mean-group estimator, mean-group estimator, panel cointegration
Introduction
In recent years, the dynamic panel-data literature has begun to focus on panels in
which the number of cross-sectional observations (N ) and the number of time-series
observations (T ) are both large. The availability of data with greater frequency is
certainly a key contributor to this shift. Some cross-national and cross-state datasets,
for example, are now large enough in T such that each nation (or state) can be estimated
separately.
The asymptotics of large N , large T dynamic panels are different from the asymptotics of traditional large N , small T dynamic panels. Small T panel estimation usually relies on fixed- or random-effects estimators, or a combination of fixed-effects estimators and instrumental-variable estimators, such as the Arellano and Bond (1991)
generalized method-of-moments estimator. These methods require pooling individual
groups and allowing only the intercepts to differ across the groups. One of the central findings from the large N , large T literature, however, is that the assumption of
homogeneity of slope parameters is often inappropriate. This point has been made
by Pesaran and Smith (1995); Im, Pesaran, and Shin (2003); Pesaran, Shin, and Smith
(1997, 1999); and Phillips and Moon (2000).1
c 2007 StataCorp LP
st0125
198
With the increase in time observations inherent in large N , large T dynamic panels,
nonstationarity is also a concern. Recent papers by Pesaran, Shin, and Smith (1997,
1999) offer two important new techniques to estimate nonstationary dynamic panels in
which the parameters are heterogeneous across groups: the mean-group (MG) and pooled
mean-group (PMG) estimators. The MG estimator (see Pesaran and Smith 1995) relies
on estimating N time-series regressions and averaging the coefficients, whereas the PMG
estimator (see Pesaran, Shin, and Smith 1997, 1999) relies on a combination of pooling
and averaging of coefficients.
In recent empirical research, the MG and PMG estimators have been applied in a
variety of settings. Freeman (2000), for example, uses the estimators to evaluate statelevel alcohol consumption over 19611995. Martinez-Zarzoso and Bengochea-Morancho
(2004) use them in an estimation of an environmental Kuznets curve in a panel of
22 OECD nations over 19751998. Frank (2005) uses the MG and PMG estimators to
evaluate the long-term effect of income inequality on economic growth in a panel of U.S.
states over 19452001.
Assume an autoregressive distributive lag (ARDL) (p, q1 , . . . , qk ) dynamic panel specification of the form
yit =
p
X
ij yi,tj +
q
X
ij Xi,tj + i + it
(1)
j=0
j=1
p1
X
j=1
ij yi,t1 +
q1
X
ij Xi,tj + i + it
(2)
j=0
Pp
Pq
P
Pp
where i = (1 j=1 ij ), i = j=0 ij /(1 k ik ), ij = m=j+1 im
Pq
1, 2, . . . , p 1, and ij
= m=j+1 im j = 1, 2, . . . , q 1.
j=
199
be significantly negative under the prior assumption that the variables show a return
0
to a long-run equilibrium. Of particular importance is the vector i , which contains the
long-run relationships between the variables.
The recent literature on dynamic heterogeneous panel estimation in which both N
and T are large suggests several approaches to the estimation of (2). On one extreme,
a fixed-effects (FE) estimation approach could be used in which the time-series data for
each group are pooled and only the intercepts are allowed to differ across groups. If
the slope coefficients are in fact not identical, however, then the FE approach produces
inconsistent and potentially misleading results. On the other extreme, the model could
be fitted separately for each group, and a simple arithmetic average of the coefficients
could be calculated. This is the MG estimator proposed by Pesaran and Smith (1995).
With this estimator, the intercepts, slope coefficients, and error variances are all allowed
to differ across groups.
More recently, Pesaran, Shin, and Smith (1997, 1999) have proposed a PMG estimator that combines both pooling and averaging. This intermediate estimator allows the
intercept, short-run coefficients, and error variances to differ across the groups (as would
the MG estimator) but constrains the long-run coefficients to be equal across groups (as
would the FE estimator). Since (2) is nonlinear in the parameters, Pesaran, Shin, and
Smith (1999) develop a maximum likelihood method to estimate the parameters.
Expressing the likelihood as the product of each cross-sections likelihood and taking
the log yields
0
lT ( , , ) =
N
N
1X 1
T X
ln(2i2 )
{yi i i ()}0 Hi {yi i i ()}
2 i=1
2 i=1 i2
(3)
200
b2 X 0 Xi
i
i
bi2
b1 X 0 b1
b12
0 b
b
1 1
b12
bN X 0 bN
N
2
bN
..
.
0
..
.
b1 X 0 W1
b12
0
b
1 W1
b12
0 b
bN
N
2
bN
bN X 0 WN
N
2
bN
..
.
..
.
0
..
.
0
bN
WN
2
bN
..
.
0
..
.
W10 W1
b12
0
WN
WN
2
bN
The MG parameters are simply the unweighted means of the individual coefficients.
For example, the MG estimate of the error correction coefficient, , is
b = N 1
N
X
i=1
bi
(4)
bb=
X
1
b2
(bi )
N (N 1) i=1
(5)
The mean and variance of other short-run coefficients are similarly estimated.
3.1
Syntax
xtpmg varlist
if
in
3.2
Options
lr(varlist) specifies the variables to be included when calculating the long-run cointegrating vector.
ec(string) is used to specify the name of the newly created error-correction term; default
is ec.
replace overwrites the error-correction variable, if it exists.
constraints(string) specifies the constraints to be applied to the model. This option
is currently used only with option pmg.
201
noconstant suppresses the constant term. This option cannot be used with option dfe.
cluster(varname) specifies that the observations are independent across groups (clusters), but not necessarily within groups. varname specifies to which group each
observation belongs, e.g., cluster(personid) in data with repeated observations
on individuals. cluster() affects the estimated standard errors and variance
covariance matrix of the estimators (VCE), but not the estimated coefficients; see
[U] 20.14 Obtaining robust variance estimates.
level(#) sets the confidence level; default is level(95).
algorithm spec is
technique(algorithm
spec) specifies
the ml
maximization technique.
algorithm # algorithm #
. . . , where algorithm is nr | bfgs | dfp . The
bhhh algorithm is not compatible with xtpmg. technique() can be used only with
option pmg.
difficult will use a different stepping algorithm in nonconcave regions of the likelihood.
full specifies that all N cross-section regression results be listed. Only the averaged
coefficients are listed by default.
model is the type of estimator to be fitted and is one of the following:
pmg is the default and specifies the PMG estimator. This model constrains the longrun coefficient vector to be equal across panels while allowing for group-specific
short-run and adjustment coefficients.
mg specifies the MG estimator. This model fits parameters as averages of the N
individual group regressions.
dfe specifies the dynamic fixed-effects estimator.
4
4.1
We illustrate the use of xtpmg with annual aggregate consumption data for 24 Organisation for Economic Co-operation and Development (OECD) nations. These data are taken
from Pesaran, Shin, and Smith (1997, 1999) and encompass the years 19601993.3 The
1993 annual observation for Belgium is not included in the estimation sample, leaving
an estimation period of 19621992 for Belgium and 19621993 for the other 23 OECD
countries. xtpmg requires that the data be tsset before estimation.
. use jasa2
. tsset id year
panel variable:
time variable:
id (unbalanced)
year, 1960 to 1993
3. The original data and GAUSS code are available on Pesarans web site:
http://www.econ.cam.ac.uk/faculty/pesaran.
202
(6)
(7)
i
1i ,
it =
10i +11i
1i ,
and 2i =
(8)
20i +21i
1i .
The error-correction speed of adjustment parameter, i , and the long-run coefficients, 1i and 2i , are of primary interest. With the inclusion of 0i , a nonzero mean
of the cointegrating relationship is allowed. One would expect i to be negative if the
variables exhibit a return to long-run equilibrium. Most aggregate consumption theories
indicate that the long-run income elasticity, 1i , should be equal to one. The inflation
effect, 2i , is generally thought to be negative.
4.2
PMG estimation
The first example estimates the PMG estimator for model (8). In this context, the PMG
model allows for heterogeneous short-run dynamics and common long-run income and
inflation elasticities. Often only the long-run parameters are of interest. The default
results of the pmg option include the long-run parameter estimates and the averaged
short-run parameter estimates.4
4. The PMG standard errors match the GAUSS output provided by Pesaran, Shin, and Smith (1999).
The standard errors in table 1 of Pesaran, Shin, and Smith (1999) are, however, different from those
reported here and from the original GAUSS program.
203
Number of obs
Number of groups
Obs per group: min
avg
max
Log Likelihood
Coef.
Std. Err.
P>|z|
=
=
=
=
=
=
767
24
31
32.0
32
2327.075
ec
pi
y
-.4658438
.9044336
.0567332
.0086815
-8.21
104.18
0.000
0.000
-.5770388
.8874181
-.3546487
.9214491
ec
pi
D1.
y
D1.
_cons
-.1998761
.0321683
-6.21
0.000
-.2629247
-.1368275
-.0182588
.0277523
-0.66
0.511
-.0726522
.0361347
.3269355
.1544507
.0574236
.0216943
5.69
7.12
0.000
0.000
.2143873
.1119308
.4394838
.1969707
SR
[ec]y = 1
chi2( 1) =
Prob > chi2 =
121.18
0.0000
The corresponding 2 value of 121.2 leads to rejection of the null hypothesis of unit
income elasticity.
The full option estimates and saves an N + 1 multiple-equation model. The first
equation (labeled per option ec) presents the normalized cointegrating vector. 5 The
remaining N equations list the group-specific short-run coefficients.
5. The vector has been normalized such that the coefficient on the first term in the cointegrating
vector is 1. Accordingly, the normalized term is omitted from the estimation output.
204
Coef.
Number of obs
Number of groups
Obs per group: min
avg
max
Log Likelihood
Std. Err.
P>|z|
=
=
=
=
=
=
767
24
31
32.0
32
2327.075
ec
pi
y
-.4658438
.9044336
.0567332
.0086815
-8.21
104.18
0.000
0.000
-.5770388
.8874181
-.3546487
.9214491
ec
-.0378815
pi
D1.
-.2114431
y
D1.
.5195067
_cons
.0336383
(output omitted )
id_196
ec
-.4978606
pi
D1.
.0721044
y
D1.
.0390557
.2743539
_cons
.0240594
-1.57
0.115
-.0850371
.0092742
.0866913
-2.44
0.015
-.3813548
-.0415314
.055876
.0147912
9.30
2.27
0.000
0.023
.4099918
.0046481
.6290217
.0626285
.0887771
-5.61
0.000
-.6718605
-.3238608
.0721146
1.00
0.317
-.0692375
.2134464
.103316
.0630399
0.38
4.35
0.705
0.000
-.16344
.1507979
.2415515
.3979099
id_111
Since each group has its own estimated equation, we can, for example, predict variables intuitively.
. predict dc111 if id==111, eq(id_111)
(783 missing values generated)
4.3
205
MG estimation
The MG estimates are the unweighted mean of the N individual regression coefficients.
xtpmg with the mg option loops through all panels in the sample to estimate the parameters of (8).6
. xtpmg d.c d.pi d.y if year>=1962, lr(l.c pi y) ec(ec) replace mg
Mean Group Estimation: Error Correction Form
(Estimate results saved as mg)
Coef.
Std. Err.
P>|z|
ec
pi
y
-.3529095
.9181344
.1168025
.0272673
-3.02
33.67
0.003
0.000
-.5818381
.8646914
-.1239809
.9715774
ec
pi
D1.
y
D1.
_cons
-.3063473
.0301599
-10.16
0.000
-.3654597
-.2472349
-.0253642
.0294774
-0.86
0.390
-.0831389
.0324104
.2337588
.2082185
.0489502
.1089385
4.78
1.91
0.000
0.056
.1378182
-.005297
.3296994
.4217339
SR
The MG estimates are presented as a two-equation model: the normalized cointegrating vector and the short-run dynamic coefficients. In comparing the PMG and MG
estimators, we note that the estimated long-run income and inflation elasticities are
statistically significant and properly signed in both models. However, the PMG estimate of the inflation elasticity is larger in magnitude than the estimate from the MG
model (.47 and .35, respectively). The opposite is true for the estimated long-run
income elasticity (.90 and .92, respectively). The speed of adjustment estimates from
each model imply significantly different short-run dynamics (compare b = .20 from
b = .31 from MG).
PMG and
Recall that the PMG estimator constrains the long-run elasticities to be equal across
all panels. This pooling across countries yields efficient and consistent estimates when
the restrictions are true. Often, however, the hypothesis of slope homogeneity is rejected
empirically. If the true model is heterogeneous, the PMG estimates are inconsistent; the
MG estimates are consistent in either case. The test of difference in these models is
performed with the familiar Hausman test.7
6. Actually, since (8) is nonlinear in the parameters, xtpmg estimates the reduced-form regressions for
each group, ct = ct1 + 1 yt + 2 t + 1 yt + 2 t , and then applies Statas nlcom command to
recover the underlying parameter estimates.
7. Statas hausman test offers a sigmamore option. This option forces the variancecovariance matrix
from the efficient model (PMG here) to be used in calculating the test statistic. This is what is presented
here. See Baum, Schaffer, and Stillman (2003) for more details.
206
Test:
-.3529095
.9181344
(b-B)
Difference
-.4658438
.9044336
.1129342
.0137008
sqrt(diag(V_b-V_B))
S.E.
.126218
.0311167
The calculated Hausman statistic is 1.06 and is distributed 2 (2). Here we conclude
that the PMG estimator, the efficient estimator under the null hypothesis, is preferred.
4.4
Dynamic FE
The dynamic FE estimator, like the PMG estimator, restricts the coefficients of the cointegrating vector to be equal across all panels. The FE model further restricts the speed
of adjustment coefficient and the short-run coefficients to be equal. xtpmg with the dfe
option fits the model in (8) while allowing panel-specific intercepts.8 An allowance for
intragroup correlation in the calculation of standard errors is made with the cluster()
option.
. xtpmg d.c d.pi d.y if year>=1962, lr(l.c pi y) ec(ec) replace dfe cluster(id)
Standard errors adjusted with cluster(id) option.
Dynamic Fixed Effects Regression: Estimated Error Correction Form
(Estimate results saved as DFE)
Coef.
Std. Err.
P>|z|
ec
pi
y
-.266343
.9120574
.102506
.0468008
-2.60
19.49
0.009
0.000
-.4672509
.8203295
-.065435
1.003785
ec
pi
D1.
y
D1.
_cons
-.1794146
.0434584
-4.13
0.000
-.2645915
-.0942378
-.0280826
.0325622
-0.86
0.388
-.0919034
.0357382
.3811944
.1257634
.070876
.0805454
5.38
1.56
0.000
0.118
.24228
-.0321025
.5201089
.2836294
SR
8. For the FE model, xtpmg is simply a wrapper for Statas xtreg, fe command (see [XT] xtreg).
The underlying parameters of (8) are calculated with nlcom and stored as EC. The reduced-form model,
as estimated by xtreg, fe, is stored as DFE.
207
All coefficients from the dynamic FE model are properly signed and, in fact, similar
to the PMG and MG estimates. As discussed in Baltagi, Griffin, and Xiong (2000),
FE models are subject to a simultaneous equation bias from the endogeneity between
the error term and the lagged dependent variable. The Hausman test can be easily
performed to measure the extent of this endogeneity.
. hausman mg DFE, sigmamore
Coefficients
(b)
(B)
mg
DFE
pi
y
Test:
-.3529095
.9181344
-.266343
.9120574
(b-B)
Difference
sqrt(diag(V_b-V_B))
S.E.
-.0865666
.0060771
25.80672
6.024402
Results indicate that the simultaneous equation bias is minimal for these data and, for
this example, we conclude that the FE model is preferred over the MG model.
Conclusion
This paper follows recent advances offered by Pesaran and Smith (1995) and Pesaran,
Shin, and Smith (1997; 1999) in the estimation of nonstationary heterogeneous panels
with large N and large T . We offer a new Stata command, xtpmg, that estimates
three alternative models: a traditional dynamic FE estimator that relies on pooling of
cross-sections, an MG estimator that relies on averaging of cross-sections, and a PMG
estimator that relies on a combination of pooling and averaging of coefficients.
References
Arellano, M., and S. Bond. 1991. Some tests of specification for panel data: Monte
Carlo evidence and an application to employment equations. Review of Economic
Studies 58: 277297.
Baltagi, B. H. 2001. Econometric Analysis of Panel Data. 2nd ed. New York: Wiley.
Baltagi, B. H., J. M. Griffin, and W. Xiong. 2000. To pool or not to pool: Homogeneous
versus heterogeneous estimators applied to cigarette demand. Review of Economics
and Statistics 82: 117126.
Baum, C. F., M. E. Schaffer, and S. Stillman. 2003. Instrumental variables and GMM:
Estimation and testing. Stata Journal 3: 131.
208
Frank, M. W. 2005. Income inequality and economic growth in the U.S.: A panel
cointegration approach. Sam Houston State University Working Paper 05-03.
Freeman, D. G. 2000. Alternative panel estimates of alcohol demand, taxation, and the
business cycle. Southern Economic Journal 67: 325344.
Im, K. S., M. H. Pesaran, and Y. Shin. 2003. Testing for unit roots in heterogeneous
panels. Journal of Econometrics 115: 5374.
Martinez-Zarzoso, I., and A. Bengochea-Morancho. 2004. Pooled mean group estimation
of an environmental kuznets curve for CO2 . Economics Letters 82: 121126.
Pesaran, M. H., Y. Shin, and R. P. Smith. 1997. Estimating long-run relationships in
dynamic heterogeneous panels. DAE Working Papers Amalgamated Series 9721.
. 1999. Pooled mean group estimation of dynamic heterogeneous panels. Journal
of the American Statistical Association 94: 621634.
Pesaran, M. H., and R. P. Smith. 1995. Estimating long-run relationships from dynamic
heterogeneous panels. Journal of Econometrics 68: 79113.
Phillips, P. C. B., and H. R. Moon. 2000. Nonstationary panel data analysis: An
overview of some recent developments. Econometric Reviews 19: 263286.
About the authors
Edward F. Blackburne III is an associate professor in the Department of Economics and International Business, Sam Houston State University. His current research interests are benchmarking
racial profiling and modeling asymmetric time series.
Mark W. Frank is an associate professor in the Department of Economics and International
Business, Sam Houston State University. His current research interests are measuring effects
of income inequality on economic growth and the economics of alcohol advertising.