assumptions made in the literature (Cui and Tchetgen Tchetgen,2021b;Qiu et al.,2021a). In cases
where these assumptions are questionable, alternative assumptions such as the assumption A (Han,
2021a), the assumption made in Theorem 3.1 (Cui and Tchetgen Tchetgen,2021a), and the lower
bound of the value function (Cui and Tchetgen Tchetgen,2021b;Han,2021b;Pu and Zhang,2021;
Chen and Zhang,2021) may be employed.
The prostate, lung, colorectal, and ovarian (PLCO) cancer screening trial (Team et al.,2000), a
two-arm randomized trial examining screening tests for PLCO cancers, inspires this work. Between
November 1993 and July 2001, ten centers recruited participants across the U.S. and collected the
data up to 2015. We will focus on determining the optimal assignment of flexible sigmoidoscopy
screening (a screening test for colorectal cancer) to increase life expectancy. It is important to per-
sonalize the flexible sigmoidoscopy screening as not all individuals will experience the desired benefit
(Tang et al.,2015). Additionally, there is non-adherence to the screening in the trial, which could be
influenced by a range of unmeasured confounding factors (Lee et al.,2021;Kianian et al.,2021). For
example, relatively healthy individuals may be more likely to skip the screening. When unmeasured
confounders are present, the estimated optimal treatment regimes based on the assumption of their
absence become unreliable. An IV that effectively compensates for the bias caused by unmeasured
confounding effects can be employed to address this issue. In our case, the assigned treatment serves
as a suitable IV.
Our study extends the aforementioned literature on survival contexts by adapting and generalizing
the value search method (Jiang et al.,2017;Zhou et al.,2022;Zhang et al.,2012), which derives the
optimal treatment regime by maximizing a consistent estimator of the value function under a class of
regimes. Specifically, we introduce a novel inverse-weighted Kaplan-Meier estimator (IWKME) and a
corresponding semiparametric estimator for the optimal treatment regime that permit the utilization
of an IV to overcome unmeasured confounding. Additionally, we propose their doubly robust versions
to enhance resistance to model misspecification. The contributions of this article are summarized as
follows. First, we propose novel estimators to identify the treatment effects and assign treatments
for the time-to-event data, which go beyond specifying a Cox proportional hazards model or other
semiparametric models. Compared with the estimators in Cui and Tchetgen Tchetgen (2021b) who
considered the non-censored data, we add an extra term to ensure that our doubly robust estimator
remains consistent even when the conditional survival function of the censoring time is misspecified,
which distinguishes our estimator from theirs. As far as we know, it is the first doubly robust
estimator to identify the treatment effects using an IV in survival contexts. Previous doubly robust
estimators are based on the no the unmeasured confounding assumption (Bai et al.,2017) or focus on
the treatment effect among compliers (Lee et al.,2021). Second, we consider the smoothing approach
to improving the performances of the estimators. The value function is non-smoothed with respect to
the parameters of the treatment regimes, which yields computational challenges in the optimization.
The kernel smoothing technique solves the challenges efficiently and maintains consistency as our
3