Liao (2020), including (i),(ii),(iv), and applicability to general
ℓp
-norms. In addition, we correct a
slight technical inaccuracy in their proof relating to the derivation of bounds in probability (see
Remark 2.1).
Belloni and Oliveira (2018) did not establish a Yurinskii coupling for martingales, but rather a
central limit theorem for smooth functions of high-dimensional martingales using the celebrated
second-order Lindeberg method (see Chatterjee,2006, and references therein), explicitly accounting
for covariance degeneracy. As a consequence, their result could be leveraged to deduce a Yurinskii
coupling for martingales with additional, non-trivial technical work (see Appendix Bfor details).
Nevertheless, a Yurinskii coupling derived from Belloni and Oliveira (2018) would not feature (i),(ii),
(iv), or general
ℓp
-norms, as our results do. We discuss further the connections between our work
and the related literature in the upcoming sections, both when introducing our main theoretical
results and when presenting examples and statistical applications.
The most general coupling result of this paper (Theorem 2.1) is presented in Section 2, where
we also specialize it to a slightly weaker yet more user-friendly formulation (Proposition 2.1). Our
Yurinskii coupling for approximate martingales is a strict generalization of all previous Yurinskii cou-
plings available in the literature, offering a Gaussian mixture strong approximation for approximate
martingale vectors in
ℓp
-norm, with an improved rate of approximation when the third moments of
the data are negligible, making no assumptions on the spectrum of the data covariance matrix. A
key technical innovation underlying the proof of Theorem 2.1 is that we explicitly account for the
possibility that the minimum eigenvalue of the variance may be zero, or that its lower bound may
be unknown, with the argument proceeding using a carefully tailored regularization. Establishing a
coupling to a Gaussian mixture distribution is achieved by an appropriate conditioning argument,
leveraging a conditional version of Strassen’s theorem (Chen and Kato,2020, Theorem B.2; Monrad
and Philipp,1991, Theorem 4), along with some related technical work detailed in Appendix B. A
third-order coupling is obtained via a modification of a standard smoothing technique for Borel sets
from classical versions of Yurinskii’s coupling (see Lemma B.2 in the appendix), enabling improved
approximation errors whenever third moments are negligible.
In Proposition 2.1, we explicitly tune the parameters of the aforementioned regularization
to obtain a simpler, parameter-free version of Yurinskii’s coupling for approximate martingales,
again offering Gaussian mixture coupling distributions and an improved third-order approximation.
This specialization of our main result takes an agnostic approach to potential singularities in the
data covariance matrix and, as such, may be improved in specific applications where additional
knowledge of the covariance structure is available. Section 2also presents some further refinements
when additional structure is imposed, deriving Yurinskii couplings for mixingales, martingales, and
independent data as Corollaries 2.1,2.2, and 2.3, respectively. We take the opportunity to discuss
and correct in Remark 2.1 a technical issue which is often neglected (Pollard,2002;Li and Liao,
2020) when using Yurinskii’s coupling to derive bounds in probability. Section 2.5 presents a stylized
example portraying the relevance of our main technical results in the context of canonical factor
models, illustrating the importance of each of our new Yurinskii coupling features (i)–(iv).
Section 3considers a substantive application of our main results: strong approximation of
martingale empirical processes. We begin with the motivating example of canonical kernel density
estimation, demonstrating how Yurinskii’s coupling can be applied, and showing in Lemma 3.1 why
it is essential that we do not place any conditions on the minimum eigenvalue of the variance matrix
(iii). We then present a general-purpose strong approximation for martingale empirical processes
in Proposition 3.1, combining classical results in the empirical process literature (van der Vaart
and Wellner,1996) with our coupling from Corollary 2.2. This statement appears to be the first of
its kind for martingale data, and when specialized to independent (and not necessarily identically
distributed) data, it is shown to be superior to the best known comparable strong approximation
3