adaptive personalized strategies for assigning treatment, thus contributing to the
goals of precision health. Critically, by design, SMARTs allow the effects of these
embedded regimes (and others, such as optimal dynamic treatment regimes based
on covariates beyond those that define the trial design; [1]) to be identified and
estimated without risk of bias.
SMART designs are increasingly growing in popularity. For example, a re-
cent review by [2] cites 24 SMART protocol papers published since 2014. While
primary analyses for SMARTs sometimes aim to examine the single timepoint
static effects of the treatment options in the SMART’s nested trials, they increas-
ingly (in either primary or secondary aims) aim to evaluate the effects of embed-
ded regimes (e.g., [3, 4]) or additionally tailored individual interventions (e.g.,
[5]). When evaluating the SMART’s embedded regimes, common approaches for
estimating the expected counterfactual outcome (or “value”) of a given embed-
ded regime use inverse probability weighting (IPW) estimators, including weight-
ing and replicating approaches (introduced in [6, 7, 8]; see also [9]) and G-
computation approaches (introduced in [10, 11, 12, 13]). IPW estimators, and
some G-computation estimators (depending on how the sequential regressions are
estimated) will generally provide unbiased estimates of the value of the embed-
ded regime; however, they are inefficient in that they do not make full use of
baseline and time-updated covariates to improve estimator precision. Advances
in semiparametric efficient substitution estimators, such as longitudinal targeted
maximum likelihood estimation (TMLE), allow for the integration of machine
learning in the estimation process, enabling more precise estimates while retaining
valid inference (see [14] for a review in the context of SMARTs). Recent work has
documented the potential of flexible covariate adjustment using machine learning,
and TMLE in particular, to improve precision in single timepoint individually ran-
domized trials (eg, [15]) and cluster randomized trials (eg, [16]). Simulations used
to inform the design of SMARTs (see, e.g., [14, 17]) further support the poten-
tial benefits of longitudinal TMLE for the primary analysis of embedded regimes
in SMART studies. However, to the best of our knowledge, neither longitudinal
TMLE nor other semiparametric efficient estimators have been implemented or
reported as the primary analysis method of a published SMART.
In this paper we first review, using the “Causal Roadmap” [18], how SMART
designs can be used to identify the effects of embedded regimes, including the
expected counterfactual outcome (or value) of each regime had all participants in
the population followed it. We then describe an efficient and robust approach to
estimating these counterfactual quantities without reliance on model assumptions,
beyond the assumption of sequential randomization known by design. Specifi-
cally, we describe a longitudinal TMLE [19, 20] for estimating the values of these
embedded regimes. TMLE is a double robust, semi-parametric, efficient, plug-in
estimator that incorporates machine learning to improve efficiency without sacri-
3