approaches to dealing with unobserved confounding in the dynamic treatment regime is a much less
explored area. Recent work, for instance, explores an extension of the differences-in-differences
approach to the dynamic treatment regime Shahn et al. (2022).
Synthetic controls/interventions in the dynamic treatment regime. In this work, we present the first
extension of the synthetic controls literature to the dynamic treatment regime. Synthetic controls
method Abadie and Gardeazabal (2003); Abadie et al. (2010) —and its generalization to synthetic
interventions Agarwal et al. (2020b)—is a commonly used empirical approach to dealing with
unobserved confounding from observational panel data. However, the existing literature assumes that
units are treated only once or in a non-adaptive manner. This limits the applicability of the technique
to policy relevant settings where multiple interventions occur sequentially over a period of time.
Our work proposes an extension of the synthetic controls and synthetic interventions method, that
allows the identification of mean counterfactual outcomes under any treatment sequence, even when
the observational data came from an adaptive dynamic treatment policy. Similar to the synthetic
interventions framework, our work assumes that the panel data stem from a low-rank data generation
model and that the latent factors capture the unobserved confounding signals. In the static regime, the
low rank assumption together with a technical overlap assumption allows one to express each unit’s
mean outcomes under any sequence of interventions as linear combinations of the observed outcomes
of a carefully chosen sub-group of other units. We extend this idea to the dynamic treatment regime
under a low-rank linear structural nested mean model assumption. Our work can also be viewed as
extending the g-estimation framework for structural nested mean models Robins (2004); Vansteelandt
and Joffe (2014); Lewis and Syrgkanis (2020) to handle unobserved confounding under a low-rank
structure. Thus our work helps connect the literature on synthetic controls with that of structural
nested mean models.
Overview of methodology. The key idea of our identification strategy is to express the mean outcome
for a unit under a sequence of interventions as an additive function of “blip” effects for that sequence.
The blip effect of an intervention at a given period can be thought of as the treatment effect of
that intervention compared to a baseline intervention for that specific period, assuming a common
sequence of interventions for all other time periods. Subsequently, our low-rank assumption and a
recursive argument allows us to identify the blip effect of each treatment for each unit and time period.
Our procedure can be viewed as a dynamic programming method where a synthetic control type
procedure is used to compute “synthetic blip effects” at each step of the dynamic program to identify
time period specific causal quantities, which are subsequently used to build the overall counterfactual
quantity of estimating the outcome of any unit under any sequence of interventions.
1.1 Related Work
Panel data methods in econometrics.
This is a setting where one gets repeated measurements of
multiple heterogeneous units over say
T
time steps. Prominent frameworks include differences-
in-differences Ashenfelter and Card (1984); Bertrand et al. (2004); Angrist and Pischke (2009)
and synthetic controls Abadie and Gardeazabal (2003); Abadie et al. (2010); Hsiao et al. (2012);
Doudchenko and Imbens (2016); Athey et al. (2021); Li and Bell (2017); Xu (2017); Amjad et al.
(2018, 2019); Li (2018); Arkhangelsky et al. (2020); Bai and Ng (2020); Ben-Michael et al. (2020);
Chan and Kwok (2020); Chernozhukov et al. (2020); Fern
´
andez-Val et al. (2020); Agarwal et al.
(2021b, 2020a). These frameworks estimate what would have happened to a unit that undergoes
an intervention (i.e., a “treated” unit) if it had remained under control (i.e., no intervention), in the
potential presence of unobserved confounding. That is, they estimate the counterfactual if a treated
unit remains under control for all
T
time steps. This is a restricted case of what we consider in this
paper, which is to estimate what happens to a unit under any sequence of interventions over the
T
time steps. A critical aspect that enables the methods above is the structure between units and time
under control. One elegant encoding of this structure is through a latent factor model (also known as
an interactive fixed effect model), Chamberlain (1984); Liang and Zeger (1986); Arellano and Honore
(2000); Bai (2003, 2009); Pesaran (2006); Moon and Weidner (2015, 2017). In such models, it is
posited that there exist low-dimensional latent unit and time factors that capture unit and time specific
heterogeneity, respectively, in the potential outcomes. Since the goal in these works is to estimate
outcomes under control, no structure is imposed on the potential outcomes under intervention.
In Agarwal et al. (2020b, 2021a), the authors extend this latent factor model to incorporate latent
factorization across interventions as well, which allows for identification and estimation of coun-
2