
Optimal Control for Platooning in Vehicular Networks A PREPRINT
1.1 Related Work
Most of the research efforts so far have been concerned with studying the aerodynamic aspects of platooning Zabat
(1995); Vohra et al. (2018), the cooperative longitudinal control of trucks Tsugawa et al. (2016); Hu et al. (2020);
Milanes et al. (2014), and the stability of the platoons from a network perspective Wang et al. (2021); Petrillo et al.
(2021). However, the system aspects (e.g., platoon coordination) are still not well explored Adler et al. (2020), especially
concerning optimal control.
Previous works on platooning coordination considered the dispatching control of trucks waiting in a station/hub Adler
et al. (2020); Zhang et al. (2017); A. Johansson et al. (2020). In these works, trucks arrive at the station following a
random process (e.g., Poisson or Bernoulli), and the station decides whether they should wait to form platoons. If the
station holds them, it may build platoons with many trucks, which reduces fuel consumption. However, forcing many
trucks to wait at the station incurs high transportation delay cost. These works investigated the optimal dispatching
control at the station that minimizes an average cost function.
In Zhang et al. (2017), the authors studied optimal platoon coordination at a highway junction (hub). Their model
consisted of two trucks arriving at the hub with stochastic arrival times. If they arrive at the same time, they form a
platoon. One truck may have to wait for the other when their arrival times differ, which incurs a waiting cost. The
authors proved that it is optimal to build a platoon only when the arrival time of each truck differs by less than a
threshold.
In Adler et al. (2020), the authors studied platooning coordination of multiple trucks at a station, where the arrivals
are Poisson distributed. The station decides whether to hold trucks in order to build platoons. The authors compared
different truck dispatching policies governing the station under energy-delay tradeoff considerations. They proved the
optimality of threshold policies to control station dispatching, where the station dispatches all trucks whenever the
number of waiting trucks in the station grows above the threshold. In A. Johansson et al. (2020), the authors proposed a
similar model to the one in Adler et al. (2020) under the assumption that arrivals of trucks are i.i.d and their distribution
is known by the station. They proved that the optimal policy is of the type "one time-step look-ahead".
1.2 Model Novelty and Main Contributions
In this paper, we study the dispatching and formation of platoons from a novel perspective. In particular, like Adler
et al. (2020), A. Johansson et al. (2020) (and unlike Zhang et al. (2017)), in our model we consider a waiting station
that can hold multiple trucks. However, unlike Adler et al. (2020) and A. Johansson et al. (2020), we assume that trucks
arrive at the station, while platoons arrive alongside the station. Therefore, we dispatch trucks with arriving platoons, as
opposed to forming platoons among waiting trucks. Our cost function is also different.
We formalize the dispatching actions at the station as an optimal control problem. We then use dynamic programming
to analyze it. The main contributions of this work are as follows:
•
We derive the optimal dispatching policy governing the system under finite and infinite horizon discounted
cost criteria. We show that the policies are of threshold type with a finite threshold.
• We use Lippman’s Lippman (1973) results to derive the optimal policy for the average cost criterion.
• We present numerical results for the average cost case.
The paper is organized as follows. In Section 2, we formulate the dispatching trucks to arriving platoons problem. We
also present the Markov Decision Problem. We characterize the optimal control policy for the discounted cost (with
finite and infinite horizons) in Section 3. In Section 4, we derive the optimal policy for the average cost problem. We
present some numerical results for the average cost problem in Section 5. Our conclusion and suggestions for future
work are presented in Section 6.
2 Model and Control Problem Formulation
We consider the platooning system shown in Figure 1.
Define the station as a location where trucks arrive and can wait to join platoons. Platoons arrive alongside the station.
The station decides how trucks are dispatched from the station. We assume that only one truck at a time can be
dispatched.
The system operates in discrete time. Truck arrivals are Bernoulli with parameter p. Trucks join the same queue upon
arrival. Platoon arrivals are Bernoulli with parameter
q
. In order to avoid trivialities, we assume that
0<p<1
and
0< q < 1.
2