
Self-Tuning Tube-based Model Predictive Control
Damianos Tranos, Alessio Russo, and Alexandre Proutiere
Abstract— We present Self-Tuning Tube-based Model Predic-
tive Control (STT-MPC), an adaptive robust control algorithm
for uncertain linear systems with additive disturbances based on
the least-squares estimator and polytopic tubes. Our algorithm
leverages concentration results to bound the system uncertainty
set with prescribed confidence, and guarantees robust con-
straint satisfaction for this set, along with recursive feasibility
and input-to-state stability. Persistence of excitation is ensured
without compromising the algorithm’s asymptotic performance
or increasing its computational complexity. We demonstrate the
performance of our algorithm using numerical experiments.
I. INTRODUCTION
Model Predictive Control (MPC) [1] addresses the infinite
horizon optimal control problem in the presence of input
and state constraints by approximating it as a sequence of
finite horizon optimization problems. When the dynamics of
the system are uncertain, robust MPC methods [2] can be
employed to ensure constraint satisfaction for pre-specified
sets of system parameters and disturbances. This robustness
comes at the cost of a reduced closed-loop performance. To
mitigate this performance loss, one may leverage adaptive
control techniques [3] to learn the system dynamics in an
online manner, and in turn reduce the uncertainty causing
this loss.
Adaptive control schemes have been developed and stud-
ied mainly for unconstrained control problems. The first
results [4] concerned their asymptotic convergence properties
and established conditions under which the controller derived
from these schemes actually approaches the optimal feedback
controller obtained by assuming the full knowledge on the
system dynamics. More recently, see e.g. [5]–[7], researchers
managed to quantify the convergence rate of some classical
adaptive control schemes, such the celebrated self-tuning
regulators, as well as the price that one has to pay (in
terms of cumulative losses – often captured through the
notion of regret) to learn the system dynamics. These recent
important results are however restricted to the LQR problems
in unconstrained linear systems.
In this paper, we investigate the design and the perfor-
mance analysis of MPC schemes handling system dynamics
uncertainties. We propose to combine adaptive and robust
control methods. The adaptive control component of our
schemes allows to rapidly reduce over time the uncertainties
This work was supported by the Wallenberg AI, Autonomous Systems
and Software Program (WASP) funded by the Knut and Alice Wallenberg
Foundation.
D. Tranos, A. Russo, and A. Proutiere are with the Division of Decision
and Control Systems, School of Electrical Engineering and Computer
Science, Royal Institute of Technology (KTH), Stockholm, Sweden. Emails:
{tranos@kth.se, alessior@kth.se, alepro@kth.se }.
in a controlled and quantifiable manner, whereas the robust
control component ensures constraint satisfaction. More pre-
cisely, our contributions are as follows.
Contributions. We address the problem of controlling an
uncertain linear system with parametric and additive distur-
bances, subject to deterministic constraints. We present Self-
Tuning Tube-based Model Predictive Control (STT-MPC), an
algorithm combining adaptive and robust control techniques.
STT-MPC uses a simple Least Squares Estimator (LSE) for
estimating the dynamics and a parameter set compatible with
the observations with prescribed level of certainty. We derive
tight concentration results (similar to those presented in [7]
for the LQR problem) for this set, and exploit these results
to construct fixed complexity polyhedral approximations of
it. In STT-MPC, these approximations are used to build a
polytopic tube MPC scheme [8] to ensure robust constraint
satisfaction. We establish the recursive feasibility and the
input-to-state stability of the proposed scheme.
In contrast to previously proposed robust MPC schemes
(refer to §II for details), STT-MPC enjoys the following
properties. (i) STT-MPC uses a probabilistic rather than ro-
bust estimation scheme, leading to notably faster adaptation
rates and smaller parameter sets for which the constraints
need to be satisfied. (ii) Persistence of Excitation (PE),
required to get performance guarantees for the LSE, is
achieved without modifying either the cost or objective
function of the MPC. The added excitation is treated as an
additive disturbance and can be chosen to decay to zero with
time (e.g., at a rate 1/√t). This allows us to asymptotically
recover the performance of the standard MPC algorithm
with full knowledge of the dynamics. (iii) An input-to-state
stability analysis of STT-MPC is possible even when the
PE condition is active and without imposing any additional
restrictions on the choice of the estimate of the nominal
model parameter.
Notations. For a time dependent vector xt, we denote by
xk|tits prediction at time k+tgiven information at time
t. For any two sets Aand B, we define their Minkowski
sum as the set A ⊕ B := {a+b:a∈ A, b ∈ B}. We
also define, for any constant λ≥0, the scaled set λA:=
{λa, a ∈ A}.For any d∈N,x∈Rd, and > 0let B(x, )
denote the ball of the spectral norm centered on x. The
unit ball centered at the origin is denoted simply as B. For
any set S, and any ε > 0there exists a polytope Pthat is
an outer approximation of S, i.e. S ⊂ P ⊕ εB. We refer
to this polytope as the outer polyhedral approximation of S.
Finally, a function κ:R+→R+is of class Kif it is strictly
increasing and κ(0) = 0 and is of class K∞if in addition
arXiv:2210.00502v1 [eess.SY] 2 Oct 2022