
APREPRINT - OCTOBER 7, 2022
variable. Furthermore, there is a non-negligible data loss in regards to the temporal component. First of all, only data
in the pre-treatment period is used to fit the model and find the optimal weights of the counterfactual unit. Second, time
series evolution and interaction over time is neglected, as data is aggregated over time or treated individually for each
time period. An alternative class of models is identified by Brodersen et al. (2014), whose approach addresses many
of the previous methods limitations. The authors approach relies on Bayesian state space models which encompass
the outcome’s temporal evolution with exogenous regression components to efficiently build a counterfactual model.
State space models allow for flexibility when modelling a variable that is affected by external noise, distinguishing
between a state equation which describe the transition of the latent variable from one point in time to the next one,
and a measurement equation, which describes the accuracy of the signal1. Being fully Bayesian makes it possible
to (i) incorporate prior information about the model structures and parameters and (ii) have a posterior distribution,
and thus a probabilistic uncertainty quantification of the causal impact of the intervention. Although the models focus
on one outcome variable and multiple controls, an extension to a multivariate setting has been implemented using
Multivariate Bayesian Structural Time Series (Menchetti and Bojinov, 2020), which is limited to linear relationships
between outcomes and controls and subject to the Markovian assumption of the variables.
2.2 Assumptions
In this subsection, we set up the framework to estimate the causal effect of an intervention on the treated subject. Each
subject yi,t, i.e. each country in our application, is associated with a binary potential outcome yi,t(wi,t)∈Rwhere
wi,t ∈ {0,1}is a treatment assignment indicator with ‘1’ referring to the variable being treated (the UK) and ‘0’ to
the controls (other European countries). Furthermore, define w1:m,1:T={w1,1:T,··· ,wm,1:T}as the assignment
path up to time Tof all units i= 1, ..., m and denote w1:m,1:Ta realization of this path. As in Menchetti et al.
(2021), throughout the paper we make a set of assumptions to guarantee that the differences in the potential outcome
trajectories are a direct statistical consequence of the intervention. Since some of them can not be directly tested, we
rely on the concept of plausibility in our empirical settings. In particular, we assume the following:
Assumption 1 (Single intervention) Unit ireceived a single intervention if there exist a t0∈ {1, ..., T }such that
wi,t = 0 for all t<t0and wi,t =wi,s =wifor all t, s > t0.
This says that the treatment is single, i.e. it occur at one point in time, and is persistent, i.e. has no disruptions. Then,
we can ease the notation and drop the tsubscript for t > t0
Assumption 2 (Temporal no-interference) For all for all i∈ {1, ..., m}and all t>t0, the outcome of unit iat time
tdepends only on its own treatment path
yi,t(w1:m,t0+1:T) = yi,t(wi,t0+1:T)
If it holds, one can drop also the subscript ifrom wias this assumption asserts that whether or not other units receive
the treatment at time t0, this has no impact on other units’ potential outcome. Units do not interfere with each other
at any point in time. This is the time series equivalent of the cross-sectional Stable Unit Treatment Value Assumption
(SUTVA) Rubin (1974) also known as Temporal SUVTVA from the work of Bojinov and Shephard (2019). In our
1Formally, dropping the subscript i,yt=Z
0
αt+γxt+trepresent the observation equation and αt+1 = Φαt+Rηtis the
state equation, where t∼ N (0, σ2
)and ηt∼ N (0, σ2
). See Brodersen et al. (2014) for more details
3