RCS
RCS
Pedro H. C. Sant’Anna
Emory University
January 2025
Introduction
DiD procedures with Covariates
■ All these are implemented in DRDID and did R packages, and drdid and csdid Stata
packages.
1
Are there differences between these two cases?
2
We will focus on the 2x2 case
3
Let’s review our assumptions
Assumptions in 2x2 setup
Assumption (No-Anticipation)
For all units i, Yi,t (g) = Yi,t (∞) for all groups in their pre-treatment periods, i.e., for all
t < g.
4
Different Sampling Schemes
Panel data sampling scheme
■ Assumption states that we observe the same units in all time periods:
No need to worry about compositional changes!
5
Stationary Repeated cross-section data sampling scheme
d
Furthermore, (G, X) |T = 1 ∼ (G, X) |T = 2.
6
Stationary Repeated Cross-Section Data Sampling Schemes
■ RCS results of Abadie (2005) and Sant’Anna and Zhao (2020) really depends on this!
7
What if I want to allow for compositional changes?
8
Repeated cross-section data sampling scheme
■ Not many results are available for this case in the literature.
■ In Sant’Anna and Xu (2023), we have worked out the details on how to allow
compositional changes while doing DiD.
9
Repeated cross-section data sampling scheme
■ Not many results are available for this case in the literature.
■ In Sant’Anna and Xu (2023), we have worked out the details on how to allow
compositional changes while doing DiD.
▶ Derive the semiparametric efficiency bound for this case;
▶ Derive the semiparametric efficiency bound for cases where part of the data is a
balanced panel and another part is repeated cross-sectional (like in CPS).
■ We won’t have time to dig into these details today, as these results are very recent—I
may update these slides in the future.
10
Difference 1:
Most DiD estimators with RCS data impose a no-compositional
changes assumption.
This is not the case when panel data is available.
11
Doubly Robust estimators
Doubly Robust DiD procedure with Panel
■ Sant’Anna and Zhao (2020) considered the following doubly robust estimand when
panel data are available:
p(X) (1 − D)
D 1 − p(X)
ATTdr,p
= E − ∆Y − m∆ (X)
G= ∞
E [D] p(X) (1 − D) ,
E
1 − p(X)
where D = 1{G = 2}.
■ Note that we only need to model the evolution of Y given X for the untreated units.
■ Estimator can achieve the semiparametric efficiency bound (even without modelling
the out. evol. among treated). 12
Doubly Robust DiD procedure with stationary repeated cross-section
■ Sant’Anna and Zhao (2020) two different DR estimands with RCS data.
■ First one mimics the panel data one:
ATTdr
1
,rc
=
D · 1 {T = 2} D · 1 {T = 1}
E − · Y − mrc rc
G=∞,t=2 (X) − mG=∞,t=1 (X)
E [D · 1 {T = 2}] E [D · 1 {T = 1}]
p(X)(1−D)·1{T=2} p(X)(1−D)·1{T=1}
1−p(X) 1−p(X)
− E h i − h i · Y − mrc rc
G=∞,t=2 (X) − mG=∞,t=1 (X)
p(X)(1−D)·1{T=2} p(X)(1−D)·1{T=1}
E 1−p(X)
E 1−p(X)
14
Doubly Robust DiD procedure with repeated cross-section
Sant’Anna and Zhao (2020) second DR DiD estimand also relies on outcome regression
models for the treated unit:
ATTdr,rc
2 = ATT1dr,rc
rc
+ E mrc rc rc
G=2,t=2 (X) − mG=∞,t=2 (X) D = 1 − E mG=2,t=2 (X) − mG=∞,t=2 (X) D = 1, T = 2
rc
− E mrc rc rc
G=2,t=1 (X) − mG=∞,t=1 (X) D = 1 − E mG=2,t=1 (X) − mG=∞,t=1 (X) D = 1, T = 1 ,
■ Both DR DiD estimators for RCS data are consistent for the ATT under the exact same
conditions:
■ Even if the regression model for the outcome evolution for the treated group is
misspecified, ATTdr,rc
2 is consistent for the ATT (provided that either the pscore or the
regression models for out. evol. among untreated units are correctly specified).
dr,rc
■ However, in general, ATT2 is more efficient than ATTdr,rc
1 .
dr,rc
■ In fact, Sant’Anna and Zhao (2020) shows that ATT2 is (locally) semiparametrically
efficient.
16
Difference 2:
We need to model the outcome evol. among treated units in RCS if
we want to achieve semiparametric efficiency bound!
17
Comparing Semiparametric efficiency bounds
Semiparametric efficiency: Panel vs. Repeated cross-section data
Corollary
Assume that T is independent of (Y1 , Y0 , D, X), and the other regularity conditions stated
in Sant’Anna and Zhao (2020) hold. Then,
p,2
Vrc,2
eff − Veff
r r !2
1 1−λ λ
= E D (Yt=2 − mG=2,t=2 (X)) + (Yt=1 − mG=2,t=1 (X))
E [D]2 λ 1−λ
r r !2
2
(1 − D) p (X) 1−λ λ
+ (Yt=2 − mG=∞,t=2 (X)) + (Yt=1 − mG=∞,t=1 (X)) ≥ 0,
(1 − p (X))2 λ 1−λ
where λ = P (T = 2)
18
Semiparametric efficiency: Panel vs. Repeated cross-section data
■ However, it is, in general, not feasible to know the optimal λ during the design stage:
σ̃22 depends on post-treatment data!
■ λ = 0.5 is a “reasonable” choice, in practice.
19
Difference 3:
The best RCS DiD estimator is always less efficient than the best
Panel DiD estimator for a given sample size.
20
Take-way messages
Take-way message
▶ Check Sant’Anna and Xu (2023) for how to make your model more flexible and test for
compositional changes.
■ With RCS, there are important benefits of modeling the outcome evolution of treated
units when doing DiD.
■ Overall, for a given sample size, RCS is less efficient than Panel data.
Loss of efficiency is bigger when the pre and post-treatment sample sizes are more
imbalanced.
■ See Sant’Anna and Zhao (2020) and Sant’Anna and Xu (2023) for more details!
21
References
Abadie, Alberto, “Semiparametric Difference-in-Differences Estimators,” The Review of
Economic Studies, 2005, 72 (1), 1–19.
Sant’Anna, Pedro H. C. and Qi Xu, “Difference-in-Differences with Compositional Changes,”
arXiv:2304.13925, 2023.
Sant’Anna, Pedro H. C. and Jun Zhao, “Doubly robust difference-in-differences estimators,”
Journal of Econometrics, November 2020, 219 (1), 101–122.
21