1 Introduction
To assess the effect of a new treatment regimen (Z= 1) over a standard (or control) treatment (Z= 0) based on
data from an observational study, using causal identification a number of assumptions must be made, including
the positivity assumption. For instance, to estimate the average treatment effect (ATE), this assumption requires
0< e(x)<1, where e(x) = P(Z= 1|X=x) is the propensity score (PS), i.e., the probability of treatment
assignment, given the vector of baseline covariates X(Rosenbaum and Rubin,1983;Rubin,1997). The positivity
assumption ensures that the distributions of the related baseline covariates have a good overlap and hence a good
common support (Petersen et al.,2012;Li et al.,2018b).
The inverse probability weighting (IPW) estimator for ATE assigns to study participants weights that are
inversely proportional to their respective PSs. Thus, IPW creates a pseudo-population of participants, corrects for
observed covariates distributions imbalances between the treatment groups, and adjusts for (measured) confound-
ing bias inherent to most non-randomized studies. Nevertheless, when PSs are equal to (or near) 0 or 1, there is
violation (or near violation) of the positivity assumption, which we often refer to as lack of adequate positivity
(Petersen et al.,2012). Violations (or near violations) of the positivity assumption occur either at random (or
stochastically), i.e., by chance due the data (or underlying model) characteristics or when some subgroups of
participants can never (or barely) receive one of the treatment options under study. This can lead to moderate or
even poor overlap of the distributions of the PSs and may result in large IPW weights, especially when the ratio
[e(x)(1 −e(x))]−1is highly variable (Li and Greene,2013;Zhou et al.,2020b). As such, IPW may put a large
amount of weights on a small number of observation, which can unduly influence the estimation of the treatment
effect.
While violations of the positivity assumption can be remedied by either PS trimming or truncation, recent
advancements have introduced methods that aim to overcome the limitations of these ad hoc solutions. Some of
these novel methods propose bias-corrected estimators (Chaudhuri and Hill,2014;Ma and Wang,2020;Sasaki
and Ura,2022), while other reparametrize the PS estimation, by adding a priori covariate balancing constraints
to modify the PS model (Graham et al.,2012;Imai and Ratkovic,2014). Some consider direct optimization
techniques to derive sample weights under covariate constraints (Hainmueller,2012;Zubizarreta,2015;Wong and
Chan,2017;Hirshberg and Zubizarreta,2017) or redefine the target population altogether and bypass the need
to account for the lack of positivity (Li et al.,2018a;Matsouaka and Zhou,2020;Zhou et al.,2020b).
1.1 The positivity assumption and propensity score weighting methods
The literature defines two specific violations of the positivity assumption: random (i.e., by chance) and structural
violations (Westreich,2019;Petersen et al.,2012). Random (or stochastic) violation of the positivity assumptions
arise by happenstance, e.g., when the sample size is small or the PS model is misspecified. In such cases, increased
sample size, bias-corrected IPW trimming, PS reparameterization or direct optimization offer better alternatives
to estimate ATE (Chaudhuri and Hill,2014;Ma and Wang,2020;Sasaki and Ura,2022). Alternatively, methods
for equipoise treatment effect, i.e., the overlap weight (OW), matching weight (MW), and Shannon’s entropy
weight (EW) estimators (Matsouaka and Zhou,2020;Li et al.,2018b), can also be considered. These estimators
target treatment effects defined within the subgroup of participants for whom treatment equipoise exists.
2