Measuring The Output Gap Using Large Datasets: Matteo Barigozzi Matteo Luciani
Measuring The Output Gap Using Large Datasets: Matteo Barigozzi Matteo Luciani
Abstract
We decompose US aggregate output into potential output and the output gap by
means of a non-stationary dynamic factor model estimated on a large dataset of
macroeconomic indicators, combined with a non-parametric trend-cycle decomposi-
tion of the factors. We find that: (1) from the mid-90s to 2008 the US economy
operated above its potential; (2) as of 2018:Q4 the labor market was tighter than the
goods and services market; and (3) our output gap measure revises modestly in real-
time. Due to its purely data driven nature, our measure is a natural complementary
tool to the theoretical models used in policy institutions.
∗
We thank for helpful comment: Stephanie Aaronson, Gianni Amisano, Andrew Figura, Manuel
Gonzales-Astudillo, James Mitchell, Filippo Pellegrino, John Roberts, and Andrea Stella.
Disclaimer: the views expressed in this paper are those of the authors and do not necessarily reflect the
views and policies of the Board of Governors or the Federal Reserve System.
-2
-4
CBO
-6 BL
-8
-10
1970 1975 1980 1985 1990 1995 2000 2005 2010 2015
The blue line is the the output gap estimated by the CBO, while the red line is our estimate.
The shaded areas around our estimate are 68% and 84% confidence bands, respectively.
unemployment rate gap signals that the economy is operating above its potential, while our
estimate of the output gap suggests that the US economy was operating at its potential.
In other words, our model suggests that the labor market is tighter than the goods and
services market at the end of 2018.
Finally, we investigate the real-time properties of our model by conducting a quasi-
real-time evaluation. We find that our output gap measure revises modestly, and none of
the results presented in the previous paragraphs would have been overturned in real time.
In the last 30 years, many papers have suggested different ways to obtain a trend-cycle
decomposition of GDP, both based on univariate methods (e.g., Watson, 1986; Morley et al.,
2003; Kamber et al., 2018; Hamilton, 2018; Phillips and Shi, 2019), or on multivariate
techniques (e.g., Fleischman and Roberts, 2011; Aastveit and Trovik, 2014; Morley and
Wong, 2017; Jarociński and Lenza, 2018; Hasenzagl et al., 2018). In the latter group of
papers, two sets of them are particularly related to our approach.
The first set of papers includes those using non-stationary low-dimensional unobserved
component models to estimate the output gap (e.g., Fleischman and Roberts, 2011; Jaro-
ciński and Lenza, 2018; Hasenzagl et al., 2018). Compared with these papers, which typi-
cally consider estimates of the output gap using only highly aggregated variables such as
GDP, the unemployment rate, and price inflation, we estimate our model on a large dataset
comprising 103 variables, thus capturing information coming from a wider spectrum of the
where Dit is driven by a deterministic linear trend, and xit is the stochastic component
such that E[xit ] = 0. Letting xt = (x1t · · · xnt )0 , we assume xt ∼ I(1)—i.e., at least one of
its components has a unit root. In a high-dimensional setting, it is reasonable to assume
that fluctuations are the results of not only common trends and common cycles but also
idiosyncratic dynamics. Thus,
where Tit ∼ I(1) is the component driven by the common trends, Cit ∼ I(0) is the com-
ponent driven by the common cycles, and ξit is the idiosyncratic component, which is
allowed to be either I(1) (in presence of idiosyncratic trends) or I(0) (e.g., measurement
errors). The components Tit and Cit are driven by factors common across all series, and
thus constitute the common component defined as χit = Tit + Cit . Hence, (2) is also written
as
yit = Dit + χit + ξit . (3)
The goal of this paper is to estimate (2). To do so, we first disentangle common
and idiosyncratic dynamics by fitting a non-stationary dynamic factor model on a large
dataset of US macroeconomic indicators. Then, we decompose the common component
χit into common trends and common cycles by applying a non-parametric trend-cycle
decomposition to the estimated latent common factors.
where ft = (f1t · · · fqt )0 are the q common latent factors capturing co-movements across
series and across time, λi` = (λi1` · · · λiq` )0 are the q factor loadings of series i at lag `,
s ≥ 0 and p ≥ 1 are finite integers, and ai and bi are finite real numbers.
In what follows, we summarize the main features of model (4)-(7), while we refer the
interested reader to Barigozzi and Luciani (2019b) for a formal treatment of the model.
2.1.1 Assumptions
2.1.2 Estimation
The model specified in (4)-(7) has a state-space form, and we estimate it by Quasi-
Maximum Likelihood, implemented through the Expectation Maximization (EM) algo-
rithm, where in the E-step the factors ft are estimated with the Kalman Smoother. This
ft = Ψτt + γt , (8)
where τt = (τ1t . . . τ(q−d)t )0 is the vector of (q − d) common trends, such that τjt ∼ I(1) for
all j and Ψ is q × (q − d), while γt is a q-dimensional stationary vector—see e.g., Escribano
and Peña (1994).
We characterize the common trends and cycles by the following assumption.
Assumption 7. There exists a q × d matrix A such that rk(A) = d and the common
cycles are given by ct = A0 γt . There exist a q × (q − d) matrix B such that rk(B) = (q − d)
and the common trends are given by τt = B0 ft .
Assumption 7 implies that the common cycles vector has the same dimension as the di-
mension of the stationary component of the factor vector, i.e., the number of cointegration
relations. This is a common assumption in many trend-cycle decompositions (e.g., Kasa,
1992; Gonzalo and Granger, 1995) and it is in line with the idea of considering cycles as
deviations from long-run equilibria (see the definition of output gap given by CBO and
discussed in Section 6). The requirement that common trends are linear combinations of
the data is also standard (e.g., Gonzalo and Granger, 1995, Proposition 2).
Moreover, from (8) and Assumption 7, we have
10
and therefore, letting λτi 0 (L) = λi 0 (L)Ψ and λci 0 (L) = λi 0 (L)Ψ⊥ (Ψ0⊥ Ψ⊥ )−1 , and by com-
bining (1), (4) and (10), we have the trend-cycle decomposition of the data:
yit = Dit + λτi 0 (L)τt + λci 0 (L)ct + ξit = Dit + Tit + Cit + ξit . (11)
This decomposition does not impose any parametric model for either the common trends
(e.g., random walk) or the common cycles (e.g., AR(2)). Furthermore, this decomposition
does not assume any specific relation between the permanent and transitory shocks. In
other words, there can be macroeconomic shocks that affect both the common trends and
the common cycles.
To estimate (10) and (11) we need to face two problems: (i) the factors ft are not
observed, and (ii) the loadings Ψ are unknown. In practice, we do have a consistent
estimator of the factors from Section 2.1.2, and, based on the factor representation of
the common factors (8), we can estimate the loadings by means of PC analysis. More
b = T −2 PT fbt fb0 and denote
precisely, for given consistent estimates of the factors fbt , let S t=1 t
b b
by (Ψ Ψ⊥ ) the q × q matrix with columns given by the normalized eigenvectors of S, b
ordered according to the decreasing value of the corresponding eigenvalues, and such that
b is q ×(q −d), Ψ
Ψ b ⊥ is q ×d. Then, due to orthonormality of the eigenvectors, the estimated
common trends and common cycles are given by the projections τbt = Ψ b 0 fbt ,
b 0 fbt , and cbt = Ψ
⊥
respectively. As shown in Peña and Poncela (2006) and Zhang et al. (2018), respectively,
these projections recover consistently the space spanned by the common trends and the
cointegration vectors, which under Assumption 7 coincides with the space spanned by the
common cycles.
11
12
Finally, we chose the number of lags p in the VAR for the common factors as the
minimum number of lags such that the residuals of the VAR exhibit no autocorrelation.
13
Time-varying parameter 2: The mean of the unemployment rate drifts gradually over
time.
The idea here is that several idiosyncratic labor market and demographic characteristics
might generate an idiosyncratic trend in the unemployment rate (see Walsh, 1998; Daly
et al., 2011, among others). For example, labor supply factors such as the entry and
exit from the labor market of the baby boom generation, the increased participation rate
among women, and changes in youth labor force share. Or labor demand factors such as, for
example, possible skill or location mismatch between job openings and the characteristics of
the unemployed. Finally, the emergency and extended benefits, which are a standard policy
response to elevated cyclical unemployment that expire once the labor market recovers,
might have transitory effects on the natural rate of unemployment.5
To capture this idiosyncratic trend, we introduce and additional latent state for the
unemployment rate, which evolves over time according to the following local level model:
i.i.d.
DUR,t = DUR,t−1 + t , t ∼ N (0, σ2 ).
Notice that this modeling strategy is consistent with the unit root test performed in the
previous section.
Constraint: In the long run GDP and GDI are driven exclusively, and by the same
amount, by factors common to the whole economy and by the secular trend, while the
5
In the US standard unemployment benefits lasts for 26 weeks. After 26 weeks, there are two programs
that allow to extend further the unemployment benefits (the Emergency Unemployment Compensation and
Extended Benefits), which extend unemployment benefits eligibility up to 99 weeks. These two measures
were introduced during the great recession, the previous maximum eligibility was 65 weeks.
14
15
2.0 3.0
2.0
1.5
2.0
1.0 1.0
1.0
0.5
The left plot reports the spectral densities of the common trend ∆b τt (blue line), and the common cycle ∆b ct (orange line)
estimated when q = 2. The middle plot reports the spectral densities of the common trend (blue line), and of the two common
cycles (orange and yellow lines) estimated when q = 3. The right plot reports the spectral densities of the common trend
(blue line), and of the three common cycles (orange, yellow, and purple lines) estimated when q = 4. On the horizontal axis
of the plots in the left column we report periods τ measured in years such that the corresponding frequencies are given by
θ = 2π
4τ
.
2.5 1.4
1.5 1.2
2.0
1.0
0.6
1.0
0.5 0.4
0.5
0.2
On the horizontal axis of the plots in the left column we report periods τ measured in years such that the corresponding
frequencies are given by θ = 2π
4τ
.
16
Why do we need a third cycle to explain the unemployment rate? A possible expla-
nation is that as documented by Barnichon (2010), among others, prior to the mid 1980s,
productivity was procyclical; afterwards, it was acyclical or countercyclical. As a result,
Note that Table 2 shows the variance of the level of the estimated idiosyncratic component of log GDP.
7
17
2 2 2 2
1 1 1 1
0 0 0 0
Levels
-1 -1 -1 -1
-2 τt -2 -2 -2
-4 -4 -4 -4
-5 -5 -5 -5
1970 1975 1980 1985 1990 1995 2000 2005 2010 2015 1970 1975 1980 1985 1990 1995 2000 2005 2010 2015 1970 1975 1980 1985 1990 1995 2000 2005 2010 2015 1970 1975 1980 1985 1990 1995 2000 2005 2010 2015
4 4 4
c2t 4
First differences
2 2 2 2
0 0 0 0
-2 -2 -2 -2
-4 -4 -4 -4
τt c1t c3t
-6 -6 -6 -6
-8 -8 -8 -8
1975 1980 1985 1990 1995 2000 2005 2010 2015 1975 1980 1985 1990 1995 2000 2005 2010 2015 1975 1980 1985 1990 1995 2000 2005 2010 2015 1975 1980 1985 1990 1995 2000 2005 2010 2015
This figure reports the estimated common trend and the estimated common cycles when q = 4. The upper row shows levels
of the estimated quantities, while the lower row shows first differences.
In the complementary appendix we show that the model with three common factors fails to provide a
8
18
In other words, potential output in our framework is the sum of the local linear trend,
which captures the secular decline in long-run US output growth, and the part of GDO
which is explained by the “real” common trend. Similarly, we define the output gap as the
cyclical component of GDO:
s
X
dt = b0
λ b bt−` (13)
OG GDO,` Φ⊥ c
`=0
that is, the output gap is that part of GDO that is accounted for by the common cycles.
In what follows, we compare our estimates with the one published by the Congressional
Budget Office (CBO). The CBO estimates potential output and the output gap by using
the so-called “production function approach” (Kiley, 2013) according to which potential
output is that level of output consistent with current technologies and normal utilization
of capital and labor, and the output gap is the deviation of output from potential output.
To be more precise, the CBO model is based upon a textbook Solow growth model, with
a neoclassical production function, and estimates trends in the components of GDP based
on a variant a variant of the Okun’s law. In other words, in the CBO framework, actual
output is above its potential (the output gap is positive), when the unemployment rate is
below the natural rate of unemployment, which is in turn defined as the non-accelerating
inflation rate of unemployment (NAIRU), i.e., that level of unemployment consistent with
a stable inflation—for further details see Congressional Budget Office (2001). Notice that
also for the CBO the output gap is assumed to revert to zero in the long run as it imposes
in its forecast that in 10 years the output gap will be zero—see, e.g., Congressional Budget
Office (2004).
Figure 5 shows our measure of potential output (red line), where the shaded areas
around our estimate are 68% and 84% confidence bands, respectively, together with the
estimate produced by the CBO (blue line), and the log-level of GDP (black line). As we
can see, the two trend estimates start to diverge at the end of the ’90s, with the CBO
19
6 GDP
960
CBO
BL
940 4
GDP
CBO
BL
920 2
900
0
880
-2
860
-4
1970 1975 1980 1985 1990 1995 2000 2005 2010 2015 1975 1980 1985 1990 1995 2000 2005 2010 2015
The black line is log GDP, the blue line is the estimate of potential output computed by the CBO, while the red line is our
estimate. The shaded areas around our estimate are 68% and 84% confidence bands, respectively.
estimating higher potential growth (i.e., a steeper increase in the trend) than our model
(4.2% vs. 3.7% in 2000:Q1). However, at the beginning of the 2000s, both trend estimates
slowdown, and they reunite at the end of the sample.
Our results contrast with those of Coibion et al. (2018), who estimated that potential
output fell during the financial crisis, but then since 2010 has started to grow at approxi-
mately the same pace as before the crisis. However, as pointed out by Ng (2018) the path
of potential output estimated by Coibion et al. (2018) is heavily influenced by their choice
to include a deterministic linear trend in the model for GDP, which is responsible for the
large output gap they estimate at the end of their sample in 2017:Q4 (see also Bullard,
2012).
4 6
CBO
BL
2 4
0 2
-2 0
-4 -2
CBO
-6 BL -4
-8 -6
-10 -8
1970 1975 1980 1985 1990 1995 2000 2005 2010 2015 1975 1980 1985 1990 1995 2000 2005 2010 2015
The blue line in the left is the level of the output gap estimated by the CBO, while the red line is our estimate. The blue line
in the right is the four-quarter percent change of the output gap estimated by the CBO, while the red line is our estimate.
The shaded areas around our estimate are 68% and 84% confidence bands, respectively.
20
21
The results in Figure 7, which shows estimates obtained with our model when we
exclude all price indicators from our dataset, confirm this interpretation. Two main results
are worth discussing: first, if we exclude all price indexes our estimate of the output gap is
even larger between 2002 and 2008, which shows how our model takes signal from inflation,
but not as much as the CBO. Second, even excluding all price indexes our estimate in the
last 10 years is mainly unchanged; the only difference here is in 2009 when the output gap
grows at a faster pace if we include all prices in the dataset. This make sense as in 2009
the 12-month percent change in core PCE prices rebounded from less than 1% to about
1.7%.
Figure 7: Output gap
No price inflation indexes
Levels Four-quarter percent change
6 8
4 6
BL
Excl. prices
2 4
0 2
-2 0
-4 -2
-6 -4
BL
-8 Excl. prices -6
-10 -8
1970 1975 1980 1985 1990 1995 2000 2005 2010 2015 1975 1980 1985 1990 1995 2000 2005 2010 2015
In each plot the grey line is the estimate obtained when the model is estimated on a dataset excluding all price
indexes (ID 18, 19, 36–52). The red line is our benchmark estimate and the red shaded area are the 64% and 84%
confidence bands.
So far, we have proceeded as if the output gap is the sole indicator of the cyclical
position of the US economy. However, another widely used indicator of the cyclical position
of the economy is the unemployment rate gap, and indeed in the framework adopted by the
CBO, the output gap and the unemployment rate gap are linked via the Okun’s law. Using
our framework, we can construct an estimate of the unemployment rate gap as the cyclical
component of the unemployment rate, i.e., as that part of the unemployment rate driven by
22
5
4
CBO CBO
4 BL BL
3
3
2
2
1 1
0
0
-1
-1
-2
-2
-3
-4 -3
1970 1975 1980 1985 1990 1995 2000 2005 2010 2015 1975 1980 1985 1990 1995 2000 2005 2010 2015
The blue line is the unemployment rate gap estimated by the CBO, while the red line is our estimate. In both plots the
shaded areas around our estimate are 68% and 84% confidence bands, respectively.
Where does the 2018 decoupling of the output gap and the unemployment rate gap
come from? To answer this question, first we have to point out that in the CBO model
the relationship between the output gap and the unemployment rate gap is constrained by
the Okun’s law. In our model, instead, the output gap and the unemployment rate gap
are driven by a three-dimensional cycle process cbt , and there are no restrictions on the
impact of cbt on these measures. That said, as we can see from Figure 9, on average our
estimated output gap and unemployment rate gap are related approximately in the same
way as those estimated by the CBO; however, this relationship holds less tightly in our
model, and because of this it is possible to have a decoupling of the unemployment rate
23
Unemployment gap
2 2
1 1
0 0
-1 -1
-2 -2
-3 -3
-8 -6 -4 -2 0 2 4 6 -8 -6 -4 -2 0 2 4 6
Output gap Output gap
This figure shows the Okun’s law relationship with the output gap on the horizontal axis and the unemployment gap on
the vertical axis for the time period from 1962Q4 to 2018:Q4. In the left (right) plot the red (blue) dots correspond to
an output gap - unemployment gap pair at time t estimated by us (CBO). In both plots, the green line is the best fit
line. β̂ is the estimated coefficient of the linear regression of the output gap on the unemployment gap, and R2 is relative
goodness-of-fit coefficient.
24
3 3
2
2
1
0
1
-1
-2 0
-3
-1
-4
-5 -2
-6
-3
-7
1985 1990 1995 2000 2005 2010 2015 1980 1985 1990 1995 2000 2005 2010 2015
In each plot the blue line is the estimate of the output gap obtained on the sample ending in 2000:Q1, the red line is the
estimate obtained on the benchmark sample ending in 2018:Q4, and the yellow lines are the estimate obtained on expanding
window from 2000:Q1 to 2018:Q4. Finally, the black dots represent the estimate of the output gap for quarter Q and year
Y obtained on the sample ending at quarter Q and year Y.
Our output gap measure (left plot of Figure 10) revises very little when considering
samples no more than five years shorter than our benchmark sample, while the size of
the revision becomes relevant only when considering samples that are ten or more years
shorter than our benchmark sample (see Table 4). However, these revisions are far from
being dramatic—the only exception to this statement being the early 1990s when the model
was pointing to a much tighter economy in quasi-real-time. In particular, our quasi -real-
time estimates still indicate a constant overheating of the economy from the mid-90s to
25
Similar comments apply to the real-time properties of our unemployment gap estimate
(right plot of Figure 10). However, differently from our output gap estimates, the quasi-
real-time estimate of the unemployment gap points out to a level of slack at the beginning
of the 1990s, very close to the one estimated on the full sample.
Summing up, in general, the conclusions reached in Section 6 are still valid.
data transformations, and by setting the shrinkage parameter to 0.04 (the value that Morley and Wong
(2017) find optimal for their “Large” model), and by including four lags in the VAR.
26
4 4 4 4
2 2 2 2
0 0 0 0
OGt
-2 -2 -2 -2
BL
MW
-4 -4 -4 -4
BL BL
HP H
-6 -6 -6 -6
BL
-8 -8 KMW -8 -8
1965 1970 1975 1980 1985 1990 1995 2000 2005 2010 2015 1965 1970 1975 1980 1985 1990 1995 2000 2005 2010 2015 1965 1970 1975 1980 1985 1990 1995 2000 2005 2010 2015 1965 1970 1975 1980 1985 1990 1995 2000 2005 2010 2015
8 8 8
10
6 6 6
BL BL BL BL
MW KMW HP H
4 4 4
5
2 2 2
∆4 OGt
0 0 0
0
-2 -2 -2
-4 -4 -4
-5
-6 -6 -6
-8 -8 -8
-10
1965 1970 1975 1980 1985 1990 1995 2000 2005 2010 2015 1965 1970 1975 1980 1985 1990 1995 2000 2005 2010 2015 1965 1970 1975 1980 1985 1990 1995 2000 2005 2010 2015 1965 1970 1975 1980 1985 1990 1995 2000 2005 2010 2015
This figure compares the estimate of the output gap obtained with our model, and those obtained with a multivariate BN
decomposition derived from a Large Bayesian VAR estimated on all the 103 variables in our dataset as in Morley and Wong
(2017) (first column), the “BN filter” developed by Kamber et al. (2018) (second column), the HP filter (third column), and
the Hamilton filter (fourth column). The plots on the top row show estimates of the level of the output gap, while the plots
on the lower row show four-quarter percent change.
11, the multivariate BN decomposition of Morley and Wong (2017) yields an output gap
estimate often very similar to our estimate. In particular, similar to our model, it points to
a large output gap before the great financial crisis. However, in contrast with our estimate,
it points to a much tighter economy at the end of the sample. This latter result might be
likely linked to the fact that, as explained by Morley and Wong (2017), the unemployment
rate is the most important source of information for their model, and, as we have discussed
in Section 6, according to our model the labor market is pointing to a tighter economy than
the signal coming from the goods and services market. Moreover, as we can see from the
lower plot of the first column of Figure 11, the multivariate BN is slightly lagging compared
with our measure in particular during recessions. This might be the consequence of the
fact that Morley and Wong (2017) are using a VAR, which by nature uses only past values
to construct the output gap measure, whereas the factor model also uses contemporaneous
values.
Moving to the univariate models, as we can see from Figure 11, although the peaks
and the troughs of the output gap estimated by our measure are broadly consistent with
those estimated with univariate benchmarks, some significant differences emerge. First,
the output gap estimated with the BN filter exhibits very mild deviations from potential
output, to the point that the four-quarter percentage change of this measure decreased
only 2% during the financial crisis, thus implying that permanent factors were the main
drivers of the great recession.
27
9 Conclusions
In this paper, we measure potential output and the output gap by fitting a non-stationary
dynamic factor model on a large dataset of US macroeconomic indicators via the EM algo-
rithm, and by disentangling common trends from common cycles via the eigenanalysis of
the long-run covariance matrix of the latent common factors. Our methodology allows us to
capture the signal coming from a large information set, to account for stochastic and deter-
ministic trends, to model long-run output growth as a slow-moving process, and to model
the impact of common factors on the different variables in a dynamically heterogeneous
way.
We compare our estimate of the output gap with the one produced by the Congressional
Budget Office (CBO), and we find that: first, the dating of the turning points by the two
measures perfectly coincides. Second, since the late ’90s, the two measures diverge: the
CBO estimates that the output gap closed just a couple of years before the great financial
crisis, whereas our model suggests a persistent overheating of the economy from the mid-
90s to 2008. Third, at the end of the sample in 2018:Q4 our estimate of the output gap
suggests that the US economy was operating at its potential, whereas our estimate of
the unemployment rate gap signals that the economy was operating above its potential.
Finally, by means of a quasi-real-time exercise, we show that our output gap measure
revises modestly. This last result support the intuition that, by pooling a large number of
variables, it is possible to get a robust estimate of the signal in the data while parsing out
the noise.
In conclusion, our model proved capable of extracting signals from different dimensions
28
References
Aastveit, K. A. and Trovik, T. (2014). Estimating the output gap in real time: A factor
model approach. The Quarterly Review of Economics and Finance, 54:180–193.
Antolin-Diaz, J., Drechsel, T., and Petrella, I. (2017). Tracking the slowdown in long-run
GDP growth. The Review of Economics and Statistics, 99:343–356.
Aruoba, S. B., Diebold, F. X., Nalewaik, J., Schorfheide, F., and Song, D. (2016). Improv-
ing GDP measurement: A measurement-error perspective. Journal of Econometrics,
191:384–397.
Bai, J. and Li, K. (2016). Maximum likelihood estimation and inference for approximate
factor models of high dimension. Review of Economics and Statistics, 98:298–309.
Bai, J. and Ng, S. (2002). Determining the number of factors in approximate factor models.
Econometrica, 70:191–221.
Bai, J. and Ng, S. (2004). A PANIC attack on unit roots and cointegration. Econometrica,
72:1127–1177.
Bańbura, M. and Modugno, M. (2014). Maximum likelihood estimation of factor models
on datasets with arbitrary pattern of missing data. Journal of Applied Econometrics,
29:133–160.
Barigozzi, M., Lippi, M., and Luciani, M. (2016). Dynamic factor models, cointegration,
and error correction mechanisms. FEDS 2016-18, Board of Governors of the Federal
Reserve System.
Barigozzi, M., Lippi, M., and Luciani, M. (2019). Large-dimensional dynamic fac-
tor models: Estimation of impulse-response functions with i(1) cointegrated factors.
arXiv:1602.02398.
Barigozzi, M. and Luciani, M. (2019a). Quasi maximum likelihood estimation and inference
29
30
31
32
33
List of Abbreviations
Source:
BLS = US Department of Labor: Bureau of Labor Statistics
BEA = US Department of Commerce: Bureau of Economic Analysis
ISM = Institute for Supply Management
CB = US Department of Commerce: Census Bureau
FRB = Board of Governors of the Federal Reserve System
EIA = Energy Information Administration
WSJ = Wall Street Journal
CBO = Congressional Budget Office
FRBPHIL = Federal Reserve Bank of Philadelphia
34
35
36
37
A Real-Time
In this section, we present the fully real-time exercise. We retrieved real-time data vintages for
our 103 variables dataset starting in August 30 2013, that is after the 2013 NIPA comprehensive
data revision, and ending on March 29 2019, i.e., the vintage of data that we use to produce the
results in the paper.a Overall, we estimate the output gap on a total of 52 vintages, where each
vintage corresponds approximately to the data available the first Friday after GDI is released.
Details on the construction of the real-time data vintages are in the next section.
Figure A1 shows real-time estimates of the output gap for both our measure and the CBO
measure.b By looking at the figure, two main conclusions can be drawn: first, our estimate of
the output gap revises very little—the average size of the revision for our estimate is 25 basis
points, with a standard deviation of 35 basis points—and less than the CBO estimate. And,
second, by comparing the estimate obtained on the first vintage of data (the blue line) with the
one obtained on the last vintage (red line), it is interesting to notice that both our model and
the CBO model revised in the same direction, i.e., both models interpreted incoming data as
signaling that there was less slack in the economy.
Figure A2 shows the real-time and quasi-real-time estimates (presented in the paper) of the
output gap together in the same graph. As we can see, the real-time estimates and the quasi-
real-time estimates are very similar—the average size of the revision computed on comparable
samples is 18 basis points in the quasi-real-time exercise and 25 basis points in the real-time
exercise, with standard deviations of 26 and 35 basis points, respectively. Therefore, since the
difference between the real-time estimates and the quasi-real-time estimates are solely the results
a
Each year the BEA at the end of July revises NIPA data up to five years back in time, the so-called “annual
revision”. Every five years, the BEA produces a broader revisions of NIPA data, the so-called “comprehensive
revision”, in which it can revise the whole history of every national account variable.
b
The CBO publishes estimates of potential output two times a year (January and August). The first CBO
release that we have in our real-time database is from January 2014, while the last one is January 2019. We were
able to use all the CBO releases from January 2014 to January 2019, but the one from August 2018, for a total of
10 vintages. We cannot use the CBO estimate from August 2018, because the CBO started publishing potential
output in Chained 2012$ only in January 2019, meaning that potential output published in August 2018 was
expressed in Chained 2009$, while GDP was expressed in Chained 2012$—the BEA started publishing National
Account Statistics in Chained 2012$ following the 2018 NIPA comprehensive data revision—thus making the
computation of the output gap very complicated.
0 0
-1 -1
-2 -2
-3 -3
-4 -4
-5 -5
-6 -6
-7 -7
2011 2012 2013 2014 2015 2016 2017 2018 2011 2012 2013 2014 2015 2016 2017 2018
In each plot the blue line is the estimate of the output gap obtained with the vintage of data from August 30 2013, the
red line is the estimate obtained with the vintage of data from on March 29 2019„ and the yellow lines are the estimate
obtained with all the remaining 31 vintages. Finally, the 19 black dots represent the estimate of the output gap for quarter
Q and year Y obtained with the vintage of data ending at quarter Q and year Y corresponding to the first release of GDI
for quarter Q and year Y. Note that, for CBO the blue line represents the estimate available as of April 11 2014.
of data revisions, we can confirm the results in Orphanides and van Norden (2002) according
to which output gap revisions are mainly caused by parameter estimation, rather than data
revision.
Figure A2: Output gap
Real-time and quasi-real-time estimation
Levels Four-quarter percent change
0 3
2.5
-1
Fin
2 QRT
RT
-2
Fin 1.5
QRT
RT
-3 1
0.5
-4
-5
-0.5
-6 -1
2010 2011 2012 2013 2014 2015 2016 2017 2018 2010 2011 2012 2013 2014 2015 2016 2017 2018
In each plot the red line is the estimate of the output gap obtained on our benchmark sample (the “final” estimate),
the green line is the quasi-real-time estimate, and the yellow line is the real-time estimate, i.e., the green line is the line
connecting the black dots in Figure A1.
Page II of VII
For the first four vintages we were able to retrieve only 101 variables out of 103, as we were
not able to find vintages for two PPIs (variables 47 and 49).
Potential output and the output gap were estimated on about 9 data vintages per year
corresponding to the dates after the release of GDI. In a normal year, the first estimate of Q1
GDI is released at the end of May, and then is revised at the end of June and at the end of July;
The first estimate of Q2 GDI is released at the end of August, and then is revised September;
The first estimate of Q3 GDI is released at the end of November, and then is revised at the end
of December and at the end of January of the following year; The first estimate of Q4 GDI is
released at the end of March of the next year.
The dates in which GDI is released varies from year to year, therefore each vintage corre-
sponds to the data that were available the closest Friday to the day in which “Personal Income
and Outlays” are released. Of course, we considered only “Personal Income and Outlays” after
GDI is released.
B Forecasting
As we discussed in the Introduction of the paper, one of the reasons why economists are very
much interested in the estimation of the output gap is that, potentially, the output gap is a
gauge of future inflation. Moreover, because the economy cannot grow forever above/below its
potential growth rate, it is reasonable to expect that the output gap is also a potential gauge
of future growth. Therefore, to further test the reliability of our model and our output gap
estimate, in this Section, we test its out-of-sample behavior. To be more precise, we answer the
following questions: is our output gap estimate better than the one computed by the CBO in
forecasting GDP growth? Is it better in forecasting inflation?
In order to test the forecasting properties of our output gap measure, we run a quasi-real-
time forecasting exercise. For GDP, we forecast the annualized GDP growth rate between time
t and t + h ( h4 (GDPt+h − GDPt )) by using the following forecasting equation:
4
(GDPt+h − GDPt ) = α + β × OGt + t (I)
h
where OGt is the level of the output gap at time t. For inflation, we forecast the annualized
inflation rate between time t and t + h ( h4 (Pt+h − Pt ))by using the following Phillips curve type
forecasting equation:
X 5 X 5
4
(Pt+h − Pt ) = α + βi × OGt−i + γj πt−j + t (II)
h
i=0 j=0
where πt = Pt − Pt−1 (Kamber et al., 2018, among others, also used a similar specification of
the Phillips curve to forecast inflation).
The exercise works as follows: starting from 2000:Q1, (I) we estimate our model thus ob-
taining an estimate of OGt , and then (II) we produce forecasts of output (GDP) growth and
inflation up to h steps ahead by using (I) and (II);c then, we increase the sample size of one
c
Note that for the CBO estimate we use the final estimate, which means that we are using the final CBO
quarter and repeat the procedure up to the end of our sample for a total o (76 − h) forecasts.
Table B1 shows the ratio of the Mean Squared Errors (MSE) obtained by forecasting output
growth and inflation with our output gap estimate, over the MSE obtained when the CBO
estimate is used, together with the p-values of a Diebold and Mariano (1995) type test of equal
predictive ability. A value below 1 implies that, on average, our model did better. As we can
see, the two output gap measures perform about the same in forecasting output growth and
inflation. None of the differences in forecasting performance are statistically significant, with
the only exception being that our output gap estimate performs better than the CBO estimate
in predicting GDP growth at longer horizons.
To sum up, our output gap measure performs as good as, if not marginally better than, the
CBO measure in forecasting output growth, and as good as the CBO measure in forecasting
inflation.
C Robustness analysis
In this Appendix we show robustness results about model specification. Specifically, (1) we test
the robustness of our model to changes in the number of common factors q, and (2) to changes
in the number of lags in the factor loadings s and in the VAR for the common factors p. Finally,
(3) we investigate the effect of not allowing for local linear trend in GDO and a local level model
in the unemployment rate.
Figure C1 shows results when the model is estimated with q = 1, . . . , 5 common factors.
The results in Figure C1 confirms those discussed in the paper: in order to have a meaningful
estimate of the output gap it is necessary to include at least three factors in model. By contrast,
in order to have a meaningful estimate of the unemployment rate gap at least four factors should
be included. Finally, a model with five factors produces results in line with those obtained with
our benchmark specification, but simply produce slightly more volatile estimates.
Figure C2 compares results from our benchmark specification {q = 4, s = 1, p = 3}, with a
specification which allows for reacher dynamic {q = 4, s = 4, p = 4}. As we can see, adding lags
to our model has nearly no effects on our estimates, but to add a bit more volatility.
Figure C3 compares results from our benchmark specification, to those obtained without
allowing for a local linear trend in GDO and for a local level model in the unemployment rate.
As we can see, not allowing for a local linear trend in GDP produces a much higher estimate
estimate to forecast the final estimate of GDP growth and PCE price inflation, while we are using our quasi-
real-time estimate for predicting the final estimate of GDP growth and PCE price inflation.
Page IV of VII
-6
-1
-2
-8
-3
-10
-4
1970 1975 1980 1985 1990 1995 2000 2005 2010 2015 1970 1975 1980 1985 1990 1995 2000 2005 2010 2015
In each plot the blue line is the estimate obtained when the model with two common factors is estimated. The yellow
line is the estimate obtained when the model with three common factors is estimated, while the red line is the estimate
obtained when the model with four common factors, which is our benchmark specifications, is estimated. The green
line is the estimate obtained when the model with five common factors is estimated.
0
2
-2 1
-4 0
-6
-1
s=1, p=3 -2
-8 s=4, p=4
-3
-10
-4
1970 1975 1980 1985 1990 1995 2000 2005 2010 2015 1970 1975 1980 1985 1990 1995 2000 2005 2010 2015
In each plot the blue line is the estimate obtained when the model is estimated by allowing the common factors to affect
each variable with a maximum delay of five lags, and when the VAR for the common factors includes four lags. The
red line is the estimate obtained when the model is estimated by allowing the common factors to affect each variable
with a maximum delay of three lags, and when the VAR for the common factors includes three lags.
of potential growth after the financial crisis, and a lower output gap at the end of the sample.d
Moving to the unemployment rate, not allowing for a local level model produces an estimate of
the unemployment rate gap that it is essentially unchanged.
Page V of VII
0
2
-2 1
BL
no TV parameters 0
-4
-6
-1
-2
-8
-3
-10
-4
1970 1975 1980 1985 1990 1995 2000 2005 2010 2015 1970 1975 1980 1985 1990 1995 2000 2005 2010 2015
In each plot the blue line is the estimate obtained when the model is estimated without a local linear trend for GDO
and a local level model for the unemployment rate. The red line is our benchmark estimate and the red shaded area
are the 64% and 84% confidence bands.
1. Simulate the states by the simulation smoother (Durbin and Koopman, 2002):
2. Simulate the stationary residual of the model by the stationary bootstrap (Politis and
Romano, 1994).
Page VI of VII
(b)
4. using y
et run the EM algorithm to get a new estimate of the parameters ϕ
b(b) , and the
(b) (b) (b)
states f̌t , ζ̌t , Ďt
(b) (b) (b) (b) (b) (b) b (b) =
5. center the estimated states by: b
ft = b
ft − e
ft + f̌t , ζbt = ζbt − ζet + ζ̌t , and Dt
bt − D
D e (b) + Ď(b) .
t t
(b)
6. Run the TC decomposition on the estimated factors b ft to get a new estimate of the
(b) (b) b(b) .
b(b) and ψ
common trend τbt , the common cycles cbt , and the parameters ψ ⊥
d (b)
b (b) Ps b (b) 0
b d b(b) .
7. Estimate, for example, potential output as P Ot = DGDO,t + `=0 λGDO,` Φ τ t−`
(b)
d
Repeating this procedure several times gives a distribution of, say, potential output: {P Ot , b =
1, . . . 1000}. In order to compute the (1 − α) confidence interval, at each quarter t we compute
(b)
the sample variance of {P d O −P d Ot }, which we denote as σ 2 , and then we construct the (1 − α)
t t
d
confidence interval is given by [P d
Ot + zα/2 σt , P Ot + z1−α/2 σt ], where zα/2 = −z1−α/2 is the
α/2 quantile of a standard normal.
References
Diebold, F. X. and Mariano, R. S. (1995). Comparing predictive accuracy. Journal of Business
and Economic Statistics, 13:253–263.
Durbin, J. and Koopman, S. J. (2002). A simple and efficient simulation smoother for state
space time series analysis. Biometrika, 89:603–615.
Kamber, G., Morley, J., and Wong, B. (2018). Intuitive and reliable estimates of the output gap
from a Beveridge-Nelson filter. Review of Economics and Statistics, 100:550–566.
Orphanides, A. and van Norden, S. (2002). The unreliability of output-gap estimates in real
time. The Review of Economics and Statistics, 84:569–583.
Politis, D. N. and Romano, J. P. (1994). The stationary bootstrap. Journal of the American
Statistical Association, 89:1303–1313.