Kang R. et al.
instead of a data-driven one, but using a network to guide the direction of state search [
31
], as often done in meta
learning [
32
,
33
]. This kind of method is still in the scope of optimization method, If the physical model embedded is
believed to be trustful, the final state estimation can thus be considered to be reliable. However, the network needs to be
pretrained on the designated physical model, which means that the mismatch between the network and the physical
model during testing could lead to incorrect search direction, getting stuck in local optima, and eventually cannot
converge.
In summary, inverse function methods allow for efficient estimations, while optimization methods, coupled with physical
forward models, provide for effective estimations. However, all these ML-based methods require pre-training, which in
turn translates to requirements for preparation time and resources prior to model deployment, and more important, the
concerns in regard to the generalization performance of the model on the testing dataset.
In this work, we propose a general purpose, neural networks-based framework for the solution of inverse problems,
termed Self-Validated Physics-Embedding Network (SVPEN). The principle of SVPEN is to couple both the inverse
function and optimization methods, through embedding physical forward models into neural networks. By utilizing
physical forward model to validate the quality of estimated state, the system ensures that final state estimations are
physically reasonable, while the two problem solving modes of inverse function and optimization lend SVPEN with
their advantages of efficient and effective state estimations. Moreover, SVPEN can be deployed without the use of
pre-collected dataset and pre-training, while its structure can be adapted to update the underlying physical/ML models.
In order to demonstrate the advantages of SVPEN, this contribution considers two diverse inverse problems as we
introduced above, namely, retrieval of temperature and gas concentration from MAC, and aeroengine cycle analysis from
performance requirements (code repo:
https://github.com/RalphKang/SVPEN_1.0
). The results demonstrate the
high efficiency, effectiveness, flexibility and adaptivity of the proposed framework.
2 SVPEN
The skeleton of Self-Validated Physics-Embedding Network (SVPEN) is demonstrated in Fig. 2. In general, SVPEN is
constituted of inverse function and optimization modes. Once an observation is fed to SVPEN, the inverse function
mode is first used to give a quick estimation of the state; meanwhile, an estimation error which reflects the quality of
state estimation is also provided. The estimated state is accepted when the estimation error is smaller than an error
threshold
ε
, otherwise, the system automatically switches to the optimization mode. In the optimization mode, the
estimation error is gradually reduced to be smaller than the error threshold
ε
by performing gradient descent on the
estimated state. The control variable, i.e., estimation error, is determined by the physical forward model embedded
in both modes, thereby, assuring the acceptable estimated state is physically reasonable. In this section, we introduce
in detail the design of SVPEN, while in section 2.1, we present the design of inverse function mode. In section 2.2,
attention shifts to the introduction of the optimization mode, and finally in section 2.3, we explain the cooperation of
these two modes within the structure of SVPEN, and the associated benefits brought about.
2.1 Inverse function mode
The structure of the inverse function mode is schematically shown in Fig. 3, which is mainly constituted of three
computational components, i.e., state estimator
G1
, physical forward model
F
, and error calculation component
E
.
The state estimator is a ML model, which takes the observation
y
as the input, and transforms it to an estimated state
ˆx
.
Then the acquired
ˆx
is fed to
F
to generate an estimated observation
ˆy
. The difference between estimated and given
observations, and even the prior requirement of state can be used to calculate an error
e
, which assess the quality of
ˆx
.
The whole process can be expressed into Eqs. 1- 3.
ˆx=G1(y)(1)
ˆy=F(ˆx)(2)
e=E(y, ˆy, ˆx)(3)
According to the functionalities of physical forward model and error calculation component, they can be wrapped
together into an integrated module termed physical evaluation module, which is used to assess the state estimation
quality from a physical perspective.
2.1.1 State estimator
One can recognize that the functionality of the state estimator is not different to those of ML models in the inverse
function method [
7
,
8
,
9
,
20
,
25
,
34
], i.e., it is expected to transform observations into accurate estimations of states. In
3