A unified analysis of regression adjustment in randomized experiments Katarzyna RelugaTing Yeand Qingyuan Zhao

2025-04-27 0 0 478.43KB 17 页 10玖币
侵权投诉
A unified analysis of regression adjustment in
randomized experiments
Katarzyna Reluga,Ting Yeand Qingyuan Zhao
Abstract
Regression adjustment is broadly applied in randomized trials under the premise
that it usually improves the precision of a treatment effect estimator. However, pre-
vious work has shown that this is not always true. To further understand this phe-
nomenon, we develop a unified comparison of the asymptotic variance of a class of
linear regression-adjusted estimators. Our analysis is based on the classical theory
for linear regression with heteroscedastic errors and thus does not assume that the
postulated linear model is correct. For a completely randomized binary treatment,
we provide sufficient conditions under which some regression-adjusted estimators are
guaranteed to be more asymptotically efficient than others. We explore other settings
such as general treatment assignment mechanisms and generalized linear models, and
find that the variance dominance phenomenon no longer occurs.
Keywords: Average treatment effect; Randomized controlled trials; Covariate adjustment;
Heteroscedasticity.
Division of Biostatistics, School of Public Health, University of California, Berkeley, U.S.A. E-mail:
katarzyna.reluga@berkeley.edu.
Department of Biostatistics, University of Washington, 3980 15th Avenue NE, Box 351617, Seattle,
WA 98195, U.S.A. E-mail: tingye1@uw.edu.
Department of Pure Mathematics and Mathematical Statistics, University of Cambridge,
Centre for Mathematical Sciences, Wilberforce Road, Cambridge CB3 0WB, U.K. E-mail:
qyzhao@statslab.cam.ac.uk.
The authors gratefully acknowledge support from the Swiss National Science Foundation for the project
P2GEP2-195898.
1
arXiv:2210.04360v1 [stat.ME] 9 Oct 2022
1 Introduction
Randomized experiments are the gold standard to answer questions about causality. Many
researchers use multiple linear regression with a treatment indicator and some baseline
covariates to analyze randomized experiments, in which the treatment coefficient is often
interpreted as a causal effect. In some fields, this is known as the “analysis of covariance”
(ANCOVA), which was first proposed by Fisher (1932) to unify “two very widely appli-
cable procedures known as regression and analysis of variance”. This common practice is
motivated by the belief that regression adjustments can increase precision if covariates in
the regression are predictive of the outcome.
However, as pointed out by many authors, this is not always true especially when there
is a lot of treatment effect heterogeneity. Regression adjustment in randomized experi-
ments has been studied in two different frameworks, namely the finite-population potential
outcome model (Neyman, 1923; Rubin, 1974) and the super-population model that as-
sumes the experimental units are drawn independently from an infinite population (see e.g.
Imbens and Rubin, 2015, Chapter 7). Three estimators have been extensively studied in
the literature: the simple difference-in-means or analysis of variance (ANOVA) estimator;
the ANCOVA estimator that includes covariate main effects; and the regression-adjusted
estimator that includes covariate main effects and all treatment-covariate interactions. The
last one is termed as the analysis of heterogeneous covariance (ANHECOVA) estimator by
Ye et al. (2022). The main conclusions about the asymptotic efficiency of these estimators
are the same, regardless of whether the potential outcome model (Freedman, 2008a,b; Scho-
chet, 2010; Lin, 2013; Guo and Basse, 2021) or super-population model (Koch et al., 1998;
Yang and Tsiatis, 2001; Tsiatis et al., 2008; Schochet, 2010; Rubin and van der Laan, 2011;
Ye et al., 2022) is used. Consider two estimators ˆ
β1and ˆ
β2that converge to the same limit.
We say that ˆ
β1(asymptotically) uniformly dominates ˆ
β2if the (asymptotic) variance of ˆ
β1
is always smaller or equal than that of ˆ
β2, no matter what the underlying distribution is. In
both the potential outcome model and the super-population model, it has been found that
ANHECOVA uniformly dominates the other two, but, somewhat surprisingly, ANCOVA
does not uniformly dominate ANOVA.
A major limitation of the existing analysis of regression adjustment is that the investiga-
tions are restricted to specific estimators and provides limited insights into the phenomenon
of uniform dominance. The variance calculations are often quite technical, which further
make the theoretical results less accessible to practitioners. Furthermore, the existing lit-
erature does not tell us whether including all treatment-covariate interactions is preferred
in other cases such as stratified experiments and generalized linear models.
In this article, we provide a unified analysis for a large class of linear-regression adjusted
estimators. Besides the estimators mentioned above, our theory also applies to regression
2
estimators with some coefficients fixed (such as the difference-in-differences estimator) or
with treatment-covariate interactions only. By a simple application of the textbook theory
for linear regression with heteroscedastic errors, this analysis not only recovers the known
relationships between ANOVA, ANCOVA, and ANHECOVA, but also immediately pro-
vides a sufficient condition for uniform dominance when the expectation of the covariates
is known (see Theorem 1 below). In the more practical situation when the covariate ex-
pectation is unknown, a slightly different sufficient condition is obtained (see Theorem 2
below). This condition shows that, for example, the so-called lagged-dependent-variable
regression estimator is more efficient than the difference-in-differences estimator in random-
ized experiments, despite them having a bracketing relationship in observational studies
(Ding and Li, 2019). This unified analysis allows us to explore whether the uniform domi-
nance extends to more complicated settings and provide numerical counterexamples. Some
further remarks are provided at the end of this article, whereas proofs of the technical
Lemmas can be found in Appendix A.
2 Linear regression adjustment in randomized trials
Consider a random sample {(Ai, Xi, Yi)}n
i=1 of nunits, where Ai∈ {0,1}is a binary
treatment indicator, Xi= (Xi1, Xi2, . . . , Xip)TRpis a vector of unit covariates observed
before treatment assignment, and YiRis a real-valued outcome of the unit. We assume
that (Ai, XT
i, Yi), i = 1, . . . , n is independent and identically distributed, which is often a
good approximation when the units are randomly sampled from a large population. To
simplify the notation, we drop the subscript iwhen referring to a generic unit from the
population.
Unless mentioned otherwise, we assume that each unit receives the treatment indepen-
dently with equal probability pr(A= 1 |X) = π, where 0< π < 1is a known constant. In
other words, treatment is assigned by a simple Bernoulli trial, which approximates random
sampling without replacement that is often studied in the finite-population model (Freed-
man, 2008a,b; Lin, 2013). Under this assignment mechanism and standard assumptions in
causal inference, the average treatment effect βATE =E[Y(1) Y(0)], where Y(a)is the
potential outcome of unit iunder treatment level a, can be identified as (see e.g. Imbens
and Rubin, 2015, Chapter 7):
βATE =E(Y|A= 1) E(Y|A= 0).(1)
In this article, we consider the following class of regression adjusted estimators of βATE.
Let Γ=Γ(1) × ··· × Γ(p)Rpand ∆=∆(1) × ··· × (p)Rpbe two user-specified sets,
3
where the individual components Γ(j)and (j)are either the real line Ror a singleton.
Define the constrained ordinary least squares estimator as
ˆ
θ= (ˆα, ˆ
β, ˆγ, ˆ
δ) = arg min
γΓ
1
n
n
X
i=1 {YiαβAiγTXiAi(δTXi)}2.(2)
We sometimes use the notation ˆ
θ,∆) (and similarly for the components of ˆ
θ) to emphasize
the dependence of the estimator on the sets Γand . Lemma 1 in Section 3.1 shows that ˆ
β
is a reasonable estimator of βATE when the covariates are centered, i.e. E(X) = 0; otherwise
βATE can be estimated by ˜
β=ˆ
β+ˆ
δT¯
X, where ¯
X=Pn
i=1 Xi/n. Before examining the
asymptotic properties of ˆ
βand ˜
β, we give several examples in the class of estimators (2).
Example 1. The ANOVA, ANCOVA, ANHECOVA estimators correspond to setting Γ =
∆ = {0};Γ = Rpand ∆ = {0};Γ = Rpand ∆ = Rp.
Example 2. In some applications, the covariate vector Xinclude the baseline value of
the response before the treatment is assigned (let us call it Y0). For simplicity, suppose
the first entry of Xis Y0, so X= (X1=Y0, X2, . . . , Xp)T. The difference-in-differences
estimator corresponds to setting Γ = {1} × Rp1and ∆ = {0} × Rp1, while the lagged-
dependent-variable regression estimator corresponds to setting ΓRpand ∆ = {0}×Rp1.
In observational studies, these two estimators rely on different identification assumptions
(Ding and Li, 2019) and may converge to different limits. In the randomized experiment
described above, both estimators should converge to the average treatment effect, but we are
unaware of any comparison of their statistical efficiency in presence of covariates besides Y0.
3 A unified analysis of linear regression-adjusted esti-
mators
3.1 Covariates with known expectation
We first consider estimation of βATE when the covariates Xhave known expectation. As
will be seen in a moment, the proof of uniform dominance is fairly straightforward in this
case.
Consider the population counterpart to (2):
θ= (α, β, γ, δ) = arg min
γΓ
E{YαβA γTXA(δTX)}2.(3)
Clearly, θ=θ,∆), and we often suppress the dependence of θon ,∆) if it is clear from
the context.
4
摘要:

AuniedanalysisofregressionadjustmentinrandomizedexperimentsKatarzynaReluga,*TingYe„andQingyuanZhao…AbstractRegressionadjustmentisbroadlyappliedinrandomizedtrialsunderthepremisethatitusuallyimprovestheprecisionofatreatmenteectestimator.However,pre-viousworkhasshownthatthisisnotalwaystrue.Tofurtheru...

展开>> 收起<<
A unified analysis of regression adjustment in randomized experiments Katarzyna RelugaTing Yeand Qingyuan Zhao.pdf

共17页,预览4页

还剩页未读, 继续阅读

声明:本站为文档C2C交易模式,即用户上传的文档直接被用户下载,本站只是中间服务平台,本站所有文档下载所得的收益归上传人(含作者)所有。玖贝云文库仅提供信息存储空间,仅对用户上传内容的表现方式做保护处理,对上载内容本身不做任何修改或编辑。若文档所含内容侵犯了您的版权或隐私,请立即通知玖贝云文库,我们立即给予删除!
分类:图书资源 价格:10玖币 属性:17 页 大小:478.43KB 格式:PDF 时间:2025-04-27

开通VIP享超值会员特权

  • 多端同步记录
  • 高速下载文档
  • 免费文档工具
  • 分享文档赚钱
  • 每日登录抽奖
  • 优质衍生服务
/ 17
客服
关注