
Statistical and machine learning approaches for prediction of long-time excitation
energy transfer dynamics
Kimara Naicker,1, 2, ∗Ilya Sinayskiy,1, 2 and Francesco Petruccione1, 2, 3
1Quantum Research Group, School of Chemistry and Physics,
University of KwaZulu-Natal, Durban, KwaZulu-Natal, 4001, South Africa
2National Institute for Theoretical and Computational Sciences (NITheCS), South Africa
3School of Data Science and Computational Thinking and Department of Physics,
Stellenbosch University, Stellenbosch, 7600, South Africa
(Dated: October 28, 2022)
One of the approaches used to solve for the dynamics of open quantum systems is the hierarchical
equations of motion (HEOM). Although it is numerically exact, this method requires immense
computational resources to solve. The objective here is to demonstrate whether models such as
SARIMA, CatBoost, Prophet, convolutional and recurrent neural networks are able to bypass this
requirement. We are able to show this successfully by first solving the HEOM to generate a data set
of time series that depict the dissipative dynamics of excitation energy transfer in photosynthetic
systems then, we use this data to test the models ability to predict the long-time dynamics when
only the initial short-time dynamics is given. Our results suggest that the SARIMA model can serve
as a computationally inexpensive yet accurate way to predict long-time dynamics.
I. INTRODUCTION
Time series analysis involves methods of analysing a
series of data points that are indexed in time order. The
objective of the analysis is to collect and study the past
observations of a time series to develop an appropriate
model which describes the inherent structure of the se-
ries. This model is then used to generate future values
for the series, i.e. to make forecasts [1]. In this work, the
data being analysed is relevant to the dissipative dynam-
ics of excitation energy transfer (EET) in systems similar
to the photosynthetic open quantum system regime.
In some cases, information about the underlying dy-
namical correlations in open quantum systems can be
encoded at the initial stages of their evolution. There-
fore, it may be possible to obtain long-time dynamics of
open quantum systems from the knowledge of their short-
time evolution. This conjecture allows the bypass of the
need for direct long-time simulations. The simulation of
numerically exact methods to describe the dynamics of
open quantum systems often require immense computa-
tional resources that scale exponentially with the size of
the system under study, hence, it is desirable to develop
an approach that can accurately predict long-time dy-
namics of open quantum systems along with eliminating
the need for direct calculations to some extent.
Various numerical solutions for the dynamics of open
quantum systems have been developed considering the
complexity of system-bath interactions. The dynamics
of an open quantum system that are dependent on the
Hamiltonian of the system can be described through den-
sity matrix-based approaches in the Liouville space of the
system. The numerically exact formalism adopted in this
study is the hierarchical equations of motion (HEOM) de-
∗kimaranaicker@gmail.com
veloped by Tanimura and Kubo, and later adapted to bi-
ological light harvesting complexes by Ishizaki and Flem-
ing [2–4]. Machine learning (ML) has been applied to this
focus area in many relevant cases [5–10]. L. E. Herrera
et al. conducted a comparative study where they bench-
marked ML models based on their efficiency in predict-
ing long-time dynamics of a two-level quantum system
linearly coupled to harmonic bath [9].
Successful time series forecasting depends on an appro-
priate model fitting. The development of efficient models
to improve forecasting accuracy has evolved in literature.
A comparison of the predictive capabilities of a standard
statistical, an additive regression and a tree-based model
against more structurally complex neural network models
to simulate the open quantum system dynamics is car-
ried out using Python. The first stage of the dynamics
is obtained by solving the HEOM for a sufficiently large
theoretical system, thereafter, we train and test suitable
models to determine the validity of our approach. That
is to predict a time series from that series past values
efficiently.
This paper contains several sections which are orga-
nized as follows: in Sec II we describe the formalism
used, Sec III describes the data pre-processing procedure,
Sec IV covers the various time series models used, Sec V
presents our experimental forecasting results in terms of
MSE obtained on relevant datasets and a brief conclusion
of our work as well as the prospective future aim in this
field.
II. THE THEORY AND THE DATA
A time series is a sequential set of data points measured
over successive times. It is mathematically defined as a
set of vectors x(t), t = 0,1,2, . . . where t represents the
time elapsed [11]. The variable x(t) can be treated as
a random variable. The measurements taken during an
arXiv:2210.14160v2 [quant-ph] 27 Oct 2022