5
the four values of the training parameter. The bifurca-
tion generated by the digital twin is shown in Fig. 2(D),
which agrees remarkably well with the true diagram even
at a detailed level. Note that there are mismatches in the
details such as the positions of some periodic windows
in Figs. 2(C) and 2(D). To predict all the features in a
bifurcation diagram requires extensive interpolation and
extrapolation of the system dynamics in the phase space.
Previously, it was suggested that RC can have a cer-
tain degree of extrapolability [34–39]. Figure 2 repre-
sents an example where the target system’s response is
extrapolated to external sinusoidal driving with unseen
amplitudes. In general, extrapolation is a difficult prob-
lem. Some limitations of the extrapolability with respect
to the external driving signal is discussed in Appendix
A, where the digital twin can predict the crisis point but
cannot extrapolate the asymptotic behavior after the cri-
sis.
In the following, we systematically study the applica-
bility of the digital twin in solving forecasting problems
in more complicated situations than the basic settings
demonstrated in Fig. 2. The issues to be addressed are
high dimensionality, the effect of the waveform of the
driving on forecasting, and the generalizability across
Lorenz-96 networks of different sizes. Results of contin-
ual forecasting and inferring hidden dynamical variables
using only rare updates of the observable are presented
in Appendices C and D, respectively.
B. Digital twins of parallel RC neural networks for
high-dimensional Lorenz-96 climate networks
We extend the methodology of digital twin to high-
dimensional Lorenz-96 climate networks, e.g., m= 20.
To deal with such a high-dimensional target system, if a
single reservoir system is used, the required size of the
neural network in the hidden layer will be too large to
be computationally efficient. We thus turn to the par-
allel configuration [25] that consists of many small-size
RC networks, each “responsible” for a small part of the
target system. For the Lorenz-96 network with m= 20
coupled nodes, our digital twin consists of ten parallel RC
networks, each monitoring and forecasting the dynamical
evolution of two nodes (Dout = 2). Because each node in
the Lorenz-96 network is coupled to three nearby nodes,
we set Din =Dout +Dcouple = 2 + 3 = 5 to ensure that
sufficient information is supplied to each RC network.
The specific parameters of the digital twin are as fol-
lows. The size of the recurrent layer is Dr= 1,200. For
each training value of the forcing amplitude A, the train-
ing and validation lengths are t= 3,500 and t= 100,
respectively. The “warming up” length is t= 20 and the
time step of the dynamical evolution of the digital twin
is ∆t= 0.025. The optimized hyperparameter values
are d= 31, λ= 0.75, kin = 0.16, kc= 0.16, α= 0.33,
β= 1 ×10−12, and σnoise = 10−2.
The periodic signal used to drive the Lorenz-96 cli-
mate network of 20 nodes is f(t) = Asin(ωt) + Fwith
ω= 2, and F= 2. The structure of the digital twin con-
sists of 20 small RC networks as illustrated in Fig. 3(A).
Figures 3(B1) and 3(B2) show a chaotic and a periodic
attractor for A= 1.8 and A= 1.6, respectively, in the
(x1, x2) plane. Training of the digital twin is conducted
by using four time series from four different values of
A, all in the chaotic regime. The attractors generated
by the digital twin for A= 1.8 and A= 1.6 are shown
in Figs. 3(C1) and 3(C2), respectively, which agree well
with the ground truth. Figure 3(D) shows the bifurca-
tion diagram of the target system (the ground truth),
where the four values of A:A= 1.8, 2.2, 2.6, and 3.0,
from which the training chaotic time series are obtained,
are indicated by the four respective vertical dashed lines.
The bifurcation diagram generated by the digital twin is
shown in Fig. 3(E), which agrees well with the ground
truth in Fig. 3(D).
C. Digital twins under external driving with varied
waveform
The external driving signal is an essential ingredient
in our articulation of the digital twin, which is particu-
larly relevant to critical systems of interest such as the
climate systems. In applications, the mathematical form
of the driving signal may change with time. Can a dig-
ital twin produce the correct system behavior under a
driving signal that is different than the one it has “seen”
during the training phase? Note that, in the examples
treated so far, it has been demonstrated that our digital
twin can extrapolate the dynamical behavior of a target
system under a driving signal of the same mathematical
form but with a different amplitude. Here, the task is
more challenging as the form of the driving signal has
changed.
As a concrete example, we consider the Lorenz-96 cli-
mate network of m= 6 nodes, where a digital twin is
trained with a purely sinusoidal signal f(t) = Asin(ωt)+
F, as illustrated in the left column of Fig. 4(A). During
the testing phase, the driving signal has the form of the
sum of two sinusoidal signals with different frequencies:
f(t) = A1sin(ω1t) + A2sin(ω2t+ ∆φ) + F, as illustrated
in the right panel of Fig. 4(A). We set A1= 2, A2= 1,
ω1= 2, ω2= 1, F= 2, and use ∆φas the bifurca-
tion parameter. The RC parameter setting is the same
as that in Fig. 2. The training and validating lengths
for each driving amplitude Avalue are t= 3,000 and
t= 12, respectively. We fine that this setting prevents
the digital twin from generating an accurate bifurcation
diagram, but a small amount of dynamical noise to the
target system can improve the performance of the digital
twin. To demonstrate this, we apply an additive noise
term to the driving signal f(t) in the training phase:
df(t)/dt =ωA cos(ωt) + δDNξ(t), where ξ(t) is a Gaus-
sian white noise of zero mean and unit variance, and δDN
is the noise amplitude (e.g., δDN = 3 ×10−3). We use