Model Reference Gaussian Process Regression:
Data-Driven Output Feedback Controller
Hyuntae Kim, Hamin Chang, and Hyungbo Shim
Abstract— Data-driven controls using Gaussian process re-
gression have recently gained much attention. In such ap-
proaches, system identification by Gaussian process regression
is mostly followed by model-based controller designs. However,
the outcomes of Gaussian process regression are often too
complicated to apply conventional control designs, which makes
the numerical design such as model predictive control employed
in many cases. To overcome the restriction, our idea is to
perform Gaussian process regression to the inverse of the
plant with the same input/output data for the conventional
regression. With the inverse, one can design a model reference
controller without resorting to numerical control methods. This
paper considers single-input single-output (SISO) discrete-time
nonlinear systems of minimum phase with relative degree one.
It is highlighted that the model reference Gaussian process
regression controller is designed directly from pre-collected
input/output data without system identification.
I. INTRODUCTION
Gaussian process regression (GPR) [1], one of the most
well-known regression tools for nonlinear functions, has
been extensively used in various fields by virtue of the
following properties [2]. First, since it is a nonparametric
method, it has some flexibility to deal with a large amount
of data. Secondly, prior knowledge of the regression target
can easily be incorporated. Finally, it gives some confidence
information about the regression result, which can be utilized
to measure the regression error.
Particularly in control systems, GPR has been mainly
applied for identifying unknown nonlinear systems using
input/output or even state data before designing a model-
based controller for the identified model. For instance, [3]
and [4] show that a model predictive controller can be
designed based on the model identified by GPR. Moreover,
its real world applications are presented in [5] and [6]
for quadrotors and mobile robots, respectively. In addition,
combining the prior knowledge of a nominal model, [7] and
[8] demonstrate the utility of such Gaussian process-based
model predictive control (GP-MPC) method in autonomous
racing systems by identifying a residual model instead of the
full dynamical system. Also, [9] presents real world exper-
iments of quadrotors controlled by the GP-MPC approach,
where only aerodynamic effects on quadrotors are modeled
by Gaussian process. On the other hand, [10] and [11]
propose a feedback linearization controller for the system,
which is identified by GPR. They also provide Lyapunov
This work was supported by the grant from Hyundai Motor Company’s
R&D Division.
All authors are with ASRI, Department of Electrical and Computer
Engineering, Seoul National University, 1 Gwanak-ro, Gwanak-gu, Seoul,
08826, Korea. Corresponding author: hshim@snu.ac.kr
stability analysis of the controlled system based on the result
concerning the error of the identification in [12]. On the other
hand, [13] and [14] propose event triggered online learning
of GPR in order to increase data efficiency.
However, most of these studies focus only on the system
identification capability of the GPR in the sense that con-
trollers should be designed only after the system identifica-
tion by using the GPR is completed. This leads to a problem
with controller design because even if the given system has
a relatively simple analytic formula, its identified model by
the GPR can be too complicated consisting of a summation
of as many terms as the number of data points. This
complexity has often restricted applicable control methods,
so that numerical methods such as MPC (model predictive
control) are typically employed for the identified model.
Since numerical control methods require a certain amount
of online computation resource, its utility can sometimes be
limited.
To overcome the restriction, we propose identification of
the inverse of the given system by GPR, with the purpose
of using it for model reference control. In this way, we can
bypass numerical controls and can combine classical controls
resulting in a data-driven controller, which we call model
reference Gaussian process regression (MR-GPR) control
in this paper. Since it is natural to assume that we have
access to only input/output measurements of the plant, we
propose the MR-GPR controller in the form of an output
feedback control. Therefore, the GPR is performed only with
input/output data of the system. Since our approach is based
on input/output inversion in some sense, a few limitations
naturally follow such as causality and minimum phase issues.
In this paper, we assume that the system has relative degree
one to resolve the causality issue, which is not very restrictive
because a sampled-data system of a continuous-time system
generically has relative degree one. Moreover, we assume
that the system is of minimum phase.
This paper is organized as follows. The problem formu-
lation with a class of nonlinear systems under consideration
and a couple of assumptions on the class of systems are in
Section II. In Section III, we propose the data-driven MR-
GPR controller and explain how to design it using the GPR.
Also, a stability analysis of the closed-loop system with the
MR-GPR controller is presented. An illustrative example that
demonstrates the usefulness of the MR-GPR controller is
given in Section IV. Finally, this paper is summarized and
concluded in Section V.
Notation: For column vectors aand b,[a;b]denotes
[aT, bT]T. For discrete-time vector sequences y(t)and z(t),
1
arXiv:2210.02494v1 [eess.SY] 5 Oct 2022