
(added to) a given team in order to invalidate (satisfy) the
given formula. Such a robustness metric is discontinuous
and cannot represent how strongly each task is satisfied.
In this paper, for the new logic CaTL+, we define two
new quantitative semantics, called traditional robustness and
exponential robustness. Both are continuous with respect to
their inputs and can measure how strongly a task is satisfied.
A higher robustness indicates more agents reach the region
and stay closer to the center for a longer time. Similar to
the traditional robustness of STL [17], the new traditional
robustness only takes into account the most satisfaction or
violation point, and ignores all other points, which is called
masking. The proposed exponential robustness eliminates
masking by considering all subformulas, all time points
and all agents, which makes the control synthesis from
CaTL+ easier and the result more robust. This new definition
also includes a novel robustness for STL, which has the
strongest mask-eliminating property in promise of soundness
compared with existing STL robustness metrics.
We also formulate and solve a centralized control synthesis
problem from CaTL+. Control synthesis for cLTL [12],
cLTL+ [14], CaTL [13] and STL with integral predicates [15]
are solved in a graph environment using a (mixed) Integer
Linear Program (ILP), where the controls are transitions
between vertices in a graph. In this paper, we consider a
continuous workspace shared by all agents. Each agent has
its own discrete time dynamics with continuous state and
control spaces. We propose a two-step optimization: a global
optimization followed by a gradient-based local optimization
to obtain the controls that maximize CaTL+ robustness.
The contribution of this paper is threefold. First, we
extend CaTL to CaTL+, which is more expressive and works
for continuous environments. Second, we define two kinds
of robustness functions for CaTL+: the traditional and the
exponential robustness. The latter is differentiable almost
everywhere and eliminates masking, which makes it suitable
for gradient-based optimization. This new definition also
includes a novel STL robustness metric. Third, we propose a
two-step optimization strategy for CaTL+ control synthesis.
The resulting controls steer the system to satisfy the given
specification robustly according to the simulation results.
II. SYSTEM MODEL AND NOTATION
Let |S| be the cardinality of a set S. We use bold symbols
to represent trajectories and calligraphic symbols for sets. For
z∈R, we define [z]+= max{0, z}and [z]−=−[−z]+.
Consider a team of agents labelled from a finite set J. Let
Cap denote a finite set of agent capabilities. We assume that
the agents operate in a continuous workspace S ⊆ Rns.
Definition 1. An agent j∈ J is a tuple Aj=
hXj, xj(0), Capj,Uj, fj, ljiwhere: Xj⊆Rnx,j is its state
space; xj(0) ∈ Xjis its initial state; Capj⊆Cap is its
finite set of capabilities; Uj⊆Rnu,j is its control space;
fj:Xj× Uj→ Xjis a differentiable function giving the
discrete time dynamics of agent j:
xj(t+ 1) = fjxj(t), uj(t), t = 0,1, . . . , H −1,(1)
Fig. 1: An earthquake emergency
response scenario. Mis the entire
square workspace. Initaand Initg
are the initial regions for the aerial
and ground vehicles. Cis where
the agents pick up supplies. Bis
a bridge and the 2rectangles R
correspond to the river. V1and V2
are 2affected villages.
where xj(t)and uj(t)are the state and control at time t,
His a finite time horizon determined by the task (detailed
later); lj:Xj→ S is a differentiable function that maps the
state of agent jto a point in the workspace shared by all
agents (this enables heterogeneous state spaces):
sj(t) = ljxj(t), t = 0,1, . . . , H. (2)
The trajectory of an agent j, called an individual trajec-
tory, is a sequence sj=sj(0) . . . sj(H). We assume that
∪j∈J Capj=Cap.
Given a team of agents {Aj}j∈J , the team trajectory
is defined as a set of pairs S={(sj, Capj)}j∈J , which
captures all the individual trajectories with the corresponding
capabilities. Let Jc={j|c∈Capj}be the set of agent
indices with capability c. Let uj=uj(0) . . . uj(H−1) be
the sequence of controls for agent j.
Example. Consider an earthquake emergency response sce-
nario. The workspace S ⊂ R2is shown in Fig. 1. There are
4ground vehicles j∈ {1,2,3,4}and 2aerial vehicles j∈
{5,6}, totaling 6robots indexed from J={1,2,3,4,5,6}
in the workspace. A river Rruns through this area and
a bridge Bgoes across the river. All ground vehicles are
identical. The dynamics fj,j∈ {1,2,3,4}are given by
px,j (t+ 1) = px,j (t) + vj(t) cos θj(t),
py,j (t+ 1) = py,j (t) + vj(t) sin θj(t),
θj(t+ 1) = θj(t) + ωj(t),
(3)
where the state xjis the 2D position and orientation
[px,j py,j θj], the control ujis the forward and angular
speed [vjωj], the state space Xj=Xg⊂R3, the
control space Uj=Ug⊂R2, the initial state xj(0) is a
singleton randomly sampled in region Initgwith randomly
sampled orientation θj∈[1
4π, 3
4π]. The function lj(xj) =
[px,j py,j ] = sjmaps the state of agent jto a position
in the workspace S. The identical capabilities are given by
Capj={“Delivery”,“Ground”},j∈ {1,2,3,4}. All the
aerial vehicles are identical. For j∈ {5,6},fjare given by
px,j (t+ 1) = px,j (t) + vx,j (t),
py,j (t+ 1) = py,j (t) + vy,j (t),(4)
where the state xjis the 2D position [px,j py,j ], the control
ujis the speed [vx,j vy,j ], the state space Xj=Xa⊂R2,
the control space Uj=Ua⊂R2, the initial state xj(0)
is a singleton randomly sampled in the region Inita, the
identity mapping lj(xj) = xj=sjmaps the state of agent
jto a position in S. The identical capabilities are given
by Capj={“Delivery”,“Inspection”},j∈ {5,6}. For