
neural networks. In particular, convolutional autoencoders have proved to be able to obtain better
generalization results [Glaws et al., 2020]. However, these approaches only leverage sample quality
metrics while missing the physical properties inherent in the CFD and the benefits of embedding
them at training time. Therefore, motivated by the big successes of convolutional neural networks in
the processing of CFD data [Guo et al., 2016, Tompson et al., 2017] and in-situ data compression
tasks [Liu et al., 2019] and the increased ability to generate large CFD data sets, we develop a
physics-driven convolutional autoencoder compression approach that builds on previous work [Glaws
et al., 2020].
In this work, we show that using an autoencoder model enhanced with two physical properties of
CFD leads to compression models that are more conformant with the physical characteristics of
the data as measured by known metrics for homogeneous isomorphic turbulent flow, including the
divergence-free condition of incompressible flow fields and the preservation of both enstrophy and
dissipation ratio [Constantin and Foias, 2020]. In addition, analyzing the model performance shows
a significant reduction in training time as well as a reduced amount of training data necessary to
achieve same-quality reconstructions as compared to the baseline. Further, our preliminary analysis
of the learned models shows better explicability compared to models trained using only sample
quality metrics. All of this encourage the use of such networks with physical-losses and illustrate that
gradient-based explainability techniques can be leveraged in the future. 1
2 Approach
While lossless data compression approaches can be options for this problem [Fout and Ma, 2012,
Lindstrom and Isenburg, 2006], they pose memory and execution time burdens for the system.
On the other hand, lossy data compression aims to reduce the memory consumption by incurring
some manageable loss of information after decompression. Hence, in our problem of reducing the
dimensionality of CFD data and their memory expense, lossy compression methods are more suitable.
Thus, a general lossy data compression function with full data space
X
and compressed data space
Y
,
can be defined in two parts: the compression step
φ:X → Y
and reconstruction step
ψ:Y → X
where the degree of compression is measured by the compression ratio (CR). To this end, we design
a convolutional autoencoder with compression function
φ
defined by the encoder
E
, data
x
and
embedded data
z
such that
E(x) = z
and decompression function
ψ
by the decoder
D
such that
D(z) = ˆx
. Thus, the compressed data is obtained from
E(x)
which can be stored more easily than
the full data, and the full data reconstruction can be recovered with D(z).
Data
The dataset we use consists of simulated snapshots of fluid velocities from incompressible
decaying isotropic flows with component velocities on the
x
,
y
and
z
dimensions. Thus, the vector
fields are comprised of 3-dimensional meshes of
128 ×128 ×128
datapoints generated by the
spectralDNS package [Mortensen and Langtangen, 2016]. To increase robustness of the network, we
introduce turbulences in the simulations as measured by Taylor-scale Reynolds numbers between (65,
105) and gather a total of 1300 snapshots for our training dataset.
Physics-informed loss
The network follows a fully convolutional architecture. In general, the main
goal of a parameterized autoencoder is to minimize the reconstruction error with some pointwise
metric such as the squared 2-norm:
ΘE,ΘD= argmin
ΘE,ΘD
||x−D(E(x; ΘE); ΘD)||2
2.(1)
Recent works have shown the advantages of using the physical properties of the domain during
training. For instance, in Raissi et al. [2019], the authors train shallow neural networks with losses
that include domain-specific physics laws and show improved generalization performance of the
trained models. Similarly, Cai et al. [2022] show how physics-informed learning improves the
inference performance for CFD domains such as three-dimensional wake flows or supersonic flows.
Thus, in addition to MSE (1), we design the autoencoder loss with two physical laws that are
applicable to the domain in consideration, the divergence-free condition and the preservation of
enstrophy. For the former, due to the incompressibility of the flow field, the density of the CFD
remains constant expressed by
∇ · ~v = 0
and therefore is a property that can be enforced. Similarly,
the enstrophy of a fluid measures the kinetic energy in the flow that corresponds to dissipation
1The code of this work is attached as supplementary material and will be made publicly available.
2