
individual segment. In prior work [12], [13] the authors
showed how to append a specific terminal reward to the short
horizon optimization problem used for each path segment
such that the reward of the overall receding horizon path is
guaranteed to exceed a desirable lower bound. This result was
demonstrated for the explicit reward function associated with
a robotic search problem in a discrete environment through
numerical experiments. In this work, we demonstrate the
utility of adding a terminal reward to a different class of
reward function based on level set estimation using Gaussian
process models. We report on the results of challenging real-
world experiments accomplished using a real autonomous
underwater vehicle (AUV) performing on-line planning using
live data.
Throughout this work, we seek to rapidly map the 15 meter
isobath within a bounded area of Claytor lake in Dublin,
VA, USA. The specific region of Claytor lake where the
experiments of this work are performed is shown in Figure
4 with the solid red lines indicating the boundaries of the
operational area. The AUV used for these experiments is the
Virginia Tech 690 vehicle shown in Figure 1.
The remainder of this paper is organized as follows.
Gaussian process regression and the specific considerations
and methods used in this work are presented in Section II.
The specific reward function used for evaluating paths is
presented in Section III. The path planning methods proposed
in [13] and used in this work are presented and discussed
in Section IV. The usage and relevant capabilities of the
690 vehicle are discussed in Section V. The experimental
procedure and selection of parameters is presented in Section
VI. Results and conclusions are presented in Section VII.
II. GAUSSIAN PROCESS REGRESSION
In this section we provide a brief review of Gaussian
process regression. We refer the reader to [14, Chapter 2]
for a more thorough discussion of the topic.
A Gaussian process is a collection of random variables,
any finite number of which have a joint Gaussian distribution
[14, Definition 2.1]. A GP is completely specified by its mean
and covariance functions m(p)and k(p, p0)respectively and
is written
f(p)∼ GP(m(p), k(p, p0)) (1)
where f(p)is a real process. In this work, we assume that the
bathymetry or depth of Claytor lake within the operational
area may be modeled using a GP.
A sample or datum d= (p, z)is an input-measurement
pair with p∈Pand z∈R. Let Dtbe the set of all
samples acquired up to time t. For any input pi, an associated
measurement ziis related to pivia
zi=f(pi) + i,for i= 1,...,|Dt|(2)
where |D| denotes the cardinality of the set Dand i∼
N(0, σ2
n)is drawn from a zero mean Gaussian distribution
with standard deviation σn.
Given the data set Dt, one may wish to predict f(p∗)for
an arbitrary input p∗. From [14, Chapter 2.7], the posterior
predictive distribution of f(p∗)conditioned on Dtis Gaus-
sian and specified by the mean µp∗|Dtand variance σ2
p∗|Dt
given as
µp∗|Dt=m(p∗) + k(p∗)>(Kt+σ2
nI)−1(Zt−m(Pt)) (3)
σ2
p∗|Dt=k(p∗, p∗)−k(p∗)>(Kt+σ2
nI)−1k(p∗).(4)
Zt= [z1, . . . , z|Dt|]>is the vector of all measurements
contained in Dtand Pt={p1, . . . , p|Dt|}is the set of as-
sociated inputs. The prior mean vector is [m(Pt)]i=m(pi),
[k(p∗)]i=k(p∗, pi),[Kt]i,j =k(pi, pj), and Itis the
|Dt| × |Dt|identity matrix.
When computing the predictive distribution at p∗, the
matrix inversion (Kt+σ2
nI)−1can become intractable as |Dt|
grows large. To address this complexity, we utilize a distance
metric d(p, p0)on Pto reduce the number of samples
retained in Dtand make predictions using local Gaussian
processes. To reduce the number of samples retained in
Dt, a new sample dc= (pc, zc)is only added to Dtif
d(pc, pi)≥δffor all pi∈ Dt. For a predictive point p∗,
the associated data is limited to a locality around p∗giving
the local data set as Dp∗,t ={di∈ Dt|d(pi, p∗)≤δc}. The
local predictive mean and variance are given by equations
(3) and (4) using Dp∗,t in the place of Dt.
Throughout this work, we define the input space Pto be
two dimensional euclidean space (R2) with distance metric
d(p, p0) = kp−p0kbeing the euclidean distance. We use the
squared exponential kernel function
k(p, p0) = σ2
fexp kp−p0k2
2l2
c(5)
as the prior for the covariance function. The prior mean
function used in this work is presented in Section VI.
III. OBJECTIVE FUNCTION
We wish to determine the level set L(l∗) = {p∈
P|f(p) = l∗}. To direct the selection of locations that
should be measured in order to obtain information regarding
this level set, we require an objective function that encodes
uncertainty regarding whether p∗is an element of L(l∗). To
provide such a function, we use a measure of ambiguity
presented in [15] which is closely related to the straddle
heuristic presented in [16].
Definition 1 (Ambiguity): The ambiguity associated with a
point p∗with respect to a level lis
ap∗|Dt=βσp∗|Dt− |µp∗|Dt−l|.(6)
Concretely, if β= 2 and ap∗|Dt<0, then the model indicates
that the probability that f(p∗)falls on the opposite side of l
as µp∗|Dtis less than 2.5 percent.
Similar to [15] where batches of points are selected for
sampling, we seek to select a path or a sequence of points to
sample. Considering the ambiguity of sets of points requires
some slight modifications to account for locations that will
be sampled while no samples have yet been acquired.
Definition 2 (Anticipated Ambiguity): Let P∗=
{p∗
0, . . . , p∗
n}and P∗
i={p∗
0, . . . , p∗
i−1}with P∗
0=∅. Then