yields a linear estimator for the underlying dynamics. Alternative approaches have been developed
to identify fully nonlinear representations for the underlying equations from data [8], [9], [10],
[11]. These methods typically consider a large nonlinear function library and subsequently use
machine learning algorithms to choose a sparse subset that best matches the training data. While
such methods can be readily applied to identify sparse representations of highly nonlinear (and
even chaotic systems), their efficacy is dependent on the choice an appropriate nonlinear function
library. Related machine learning approaches using neural networks have also achieved success for
prediction in highly nonlinear dynamical systems [12], [13], [14].
Many of the data-driven model identification strategies described above have a close connection
to Koopman analysis [15], [16], [17]. Koopman-based approaches generally allow for the repre-
sentation of a nonlinear dynamical system as a linear operator acting on an infinite-dimensional
observable space. Such approaches are distinct from standard linearization techniques that consider
the dynamics in a close neighborhood of some nominal solution. Rather, the goal of Koopman anal-
ysis is to identify a linear operator that can accurately capture fundamentally nonlinear dynamics –
the key challenge is in the identification of a suitable finite basis to represent the action of the gen-
erally infinite dimensional Koopman operator. The connection between DMD and spectral analysis
of the Koopman operator is well established [4], and in applications where high-dimensional data
is readily available, the DMD algorithm can indeed be used to provide a finite-dimensional approx-
imation of the Koopman operator. Extensions of the DMD algorithm have illustrated that more
accurate approximations of the Koopman operator can be obtained using DMD in conjunction with
a set of lifting functions [18] and/or time-delayed embeddings of snapshot data [19]. Additional
accuracy can also be obtained using an adaptive strategy that uses DMD to obtain a continuous
family of linear models and actively chooses the one that provides the best representation at every
instant [20].
How best to approximate the action of the Koopman operator from snapshot data remains an
open question. DMD separates data into snapshot pairs and subsequently finds a linear operator
that provides a least squares fit for the mapping from one snapshot to the next. This is currently the
most widely used approach. An obvious advantage of linear estimators of the Koopman operator
is that they allow for subsequent analysis using a wide variety of linear techniques. Nonetheless,
such linear estimators are not always suitable for highly nonlinear systems since finite dimensional
linear operators cannot be used, for instance, to replicate the infinite-time behavior of systems with
multiple hyperbolic fixed points or systems with stable limit cycles. Further limitations of linear
estimators can also be seen in [25] which established the difficulty of representing even the relatively
simple Burgers’ equation in terms of a linear operator owing to the existence highly degenerate
Koopman eigenvalues. Alternatively, nonlinear models obtained from data-driven techniques are
often more difficult to analyze, but can admit lower dimensional realizations and can often provide
accurate representations of chaotic behavior [9], [12], [13]. Recent works have considered nonlinear
estimation strategies. For instance, [21] approximates separate Koopman operators that result
for different values of an applied control input and uses this information to formulate a switching
time optimization problem. Related approaches consider bilinear approximations of the Koopman
operator for control systems [22], [23], [24].
This work explores strategies for nonlinear data-driven estimation of the action of the Koopman
operator. General strategies that yield nonlinear models are presented for systems both with and
without control. In the various examples considered in this work, only short term predictions of
the dynamical behavior can be obtained using linear estimators for the Koopman operator. By
contrast, nonlinear estimators are able to provide accurate long-term estimates for the dynamics
of model observables and yield accurate information about limit cycling behaviors and basin of
attraction estimates. The organization of this paper is as follows: Section 2 provides necessary
2