
recent nonparametric drift estimators proposals in Della Maestra and Hoffmann [13], Belomestny et al. [3] or Comte
and Genon-Catalot [9]. These kinds of models are related to physics. We rather have in mind economic or financial
models. For instance, in Duellmann et al. [16], the authors consider a portfolio of Nhomogenous firms such that the
asset value Xi
tat time tof the i-th firm is modeled by Merton’s model (see [21]) dXi
t=µXi
tdt +σXi
tdBi
twith Xi
0=X0,
which corresponds to (1) with b(x)=µxand σ(x)=σx. Intending to capture the dependency between the firms, they
also assume that dBi
t=√ρdWt+p1−ρdWi
t, where Wis a common systematic risk factor,Wiis a firm-specific risk
factor and ρ∈[0,1]. This corresponds to a particular matrix R, precisely Ri,k=ρfor i,k(and Ri,i=1), so that one
single parameter ρrepresents the so-called asset correlation. This model has been considered in e.g. Bush et al. [4],
for the more mathematical purpose of studying the limit of the empirical distribution of the Xi
t’s (see also references
therein). Our extension from specific geometric Brownian motion to general nonparametric diffusion (1), and from
one single correlation parameter to a general matrix representation, is therefore standard in both respects. This context
has nevertheless never been considered before up to our knowledge. Let us emphasize that our aim is not to estimate
R, but to exhibit conditions on it such that b(.) can be estimated with performance near of the independent setting.
In our framework, Tis fixed, and Nis large. Our results are nonasymptotic, but the idea is that Ngrows to in-
finity. We fix a subset Iof Rand build a collection of projection least squares estimators of bI=b1Iwhere Iis
compact or not. The estimators are defined by their coefficients on an orthonormal basis of L2(I), ϕ1,...,ϕm, result-
ing from a standard least squares computation. Precisely, we consider the estimator of the drift function bminimizing
the objective function γN(τ)
τ7−→ γN(τ) :=1
NT
N
X
i=1 ZT
0
τ(Xi
s)2ds −2ZT
0
τ(Xi
s)dXi
s!(3)
on the m-dimensional function space Sm=span{ϕ1,...,ϕm}. The first part of γN(τ) involves a quantity
kτk2
N:=1
NT
N
X
i=1ZT
0
τ(Xi
s)2ds,
which is considered as the squared empirical norm of the function τ. These estimators are the same as in Comte and
Genon-Catalot [7], but their study is made significantly more difficult by the dependency context. We do not have at
our disposal any coupling method nor any transformation leading to a simpler system; in particular, applying R−1/2to
the system does not bring any simplification because of a ”widespreading” of the components of the process. Tropp’s
deviation inequalities used in the independent context (see Tropp [24], Matrix Chernov Inequality, Theorem 1.1 and
Matrix Bernstein Inequality, Theorem 1.4), which allow to consider the empirical norm and its expectation (an integral
norm, thus) as equivalent with high probability, no longer apply. Martingale properties still are useful, and we turn to
Azuma’s matrix deviation inequality (see Tropp [24], Theorem 7.1), which however requires to set sparsity conditions
on R(see Assumption 3). This equivalence property between empirical and weighted integral norms is the key of the
rigorous study of the risk of the drift estimator, and the correlation matrix is therefore at the heart of the proofs.
The plan of the paper is the following. A first parametric example motivates the model and the way of estimating
a drift parameter in Section 2. The general nonparametric drift estimator is defined in Section 3 and a risk bound on
a fixed projection space is proved. Adaptive estimation is studied in Section 4 and the whole procedure is illustrated
through simulations in Section 5. Lastly, proofs are gathered in Section 6.
2. Preliminary motivation and example in the parametric framework
This preliminary section deals with the geometric model described in the introduction, in the parametric frame-
work, in order to motivate our investigations. Similarly to Duellmann et al. [16], consider Nrisky assets of same
nature and of prices processes X1, . . . , XNobserved on the time interval [0,T]. Since these assets are of same nature,
to model their prices by dXi
t=µXi
tdt +σXi
tdBi
twith µ∈Rand σ > 0 not depending on i∈ {1,...,N}is realistic, but
it is also very realistic to consider that B1,...,BNmay be dependent, through the correlation matrix described above.
2