
In this paper, we propose an efficient method for the understanding and ensemble forecast of a general group
of complex turbulent systems accepting coupled multiscale dynamics [23, 24]. We use the ideas in the Random
Batch Method (RBM) originally developed for interacting particle systems [12, 13, 9], and apply it to the very
different problem of multiscale turbulent systems. Inspired by the stochastic gradient descent [34, 1, 11] in machine
learning, the RBM randomly divides and constrains the large number of interacting particles into small batches
in each time interval. The RBM can greatly reduce the computational cost in large particle systems and has got
many successful applications such as on manifold learning [8, 9] and quantum simulations [14]. In the ensemble
prediction of turbulent systems, we focus on the dominant flow structure in the largest scale thus the required
ensemble size to sample a low-dimensional subspace can be controlled. This reduced modeling strategy usually
suffers difficulties in practice since the large scale is closely coupled with all the unresolved small-scale fluctuations,
thus it is impossible to only perform ensemble simulation inside the large-scale subspace. Using random batches,
this difficulty is effectively overcome by regrouping the large number of small-scale fluctuating modes into small
batches each containing only a few modes. Then the batches from a single simulation of the fluctuation modes are
used for updating different ensemble members of the large-scale state separately during a short time step update.
This approximation is based on the important observation that the small-scale fluctuations often decorrelate much
faster in time and contain less energy than the mean-flow state on the largest scale. The batches of different modes
are randomly resampled before each time updating step thus contributions from all scales are well characterized
during the evolution in time. In this way, we achieve an efficient algorithm that gains high skill to capture the fully
non-Gaussian statistical feature in the most important large-scale mean flow state, while greatly reduce the high
computational cost independent of the dimensionality of the full system.
In order to achieve a detailed analysis of the method for the complex turbulent system which usually combines
various complex effects from different sources, we develop the the new RBM algorithm on a unified class of turbulent
systems with emphasis on the explicit coupling between large and small scales, while the unresolved nonlinear
coupling among small scales is modeled by damping and white noise forcing. The simplified formulation reveals the
most important key physical processes on the nonlinear interaction between the large-scale mean flow component
and the smaller scale fluctuation components. On the other hand, the extra complexity due to the nonlinear
self-interactions among the less important small scales are avoided to provide a cleaner model setup. The model
framework is shown to have many representative applications in physical and engineering problems [17, 27, 39, 29,
31]. Precise error estimations are derived based on this general framework for the convergence of the RBM approach
using a finite number of samples. The convergence of the statistical quantifies is proved based on the semigroups
generated by the backward Kolmogorov equations [40] of the RBM model. The RBM is then verified numerically
on two representative prototype turbulent models. Different non-Gaussian statistics are observed in the two models
inferring strong intermittency and extreme events induced from distict physical mechanisms. The numerical tests
show accurate prediction of both transient and equilibrium PDFs recovering various non-Gaussian features under a
much lower computational cost requiring a much smaller ensemble size.
First in the following, we display the general structures that are representative in the turbulent dynamics and
illustrate the central issues in achieving an accurate probability solution.
General formulation for turbulent systems and challenges in efficient statistical fore-
cast
The complex turbulent systems discussed above can be written as the following canonical equation about the state
variable uin a high-dimensional phase space
du
dt= Λu+B(u,u) + F(t) + σ(t)˙
W(t;ω).(1.1)
On the right hand side of the equation (1.1), the first component, Λ = −D+L, represents linear dissipation and
dispersion effects (with a negative-definite dissipation operator D < 0and a skew-symmetric dispersion operator
LT=−Las in [23]). One representative feature of such complex systems is the nonlinear energy conserving
interaction that transports energy across scales. The nonlinear effect is introduced through a bilinear quadratic
form, B(u,u), that satisfies the conservation law u·B(u,u)≡0. External forcing effects are decomposed into a
deterministic component, F(t), and a stochastic component represented by a Gaussian random process, σ˙
W.
The evolution of the model state udepends on the sensitivity to the randomness in initial conditions and
external stochastic effects. Combined with the inherent internal instability due to the nonlinear coupling term in
(1.1), small perturbations are amplified in time thus requiring a probabilistic description to completely characterize
2