1 Motivation
“What can your Neural Network tell you about the underlying physics?” is the most common question
when we apply Neural Networks to study the behavior of materials and “Nothing.” is the honest
and disappointing answer.
This manuscript challenges the notion that Neural Networks can teach us nothing about the
physics of a material. It seeks to integrate more than a century of knowledge in continuum me-
chanics [3, 4, 22, 38, 40, 47, 50, 51] and modern machine learning [24, 29, 41] to create a new family
of Constitutive Artificial Neural Networks that inherently satisfy kinematical, thermodynamical,
and physical constraints, and constrain the space of admissible functions to train robustly, even
when data are space. While this general idea is by no means new and builds on several important
recent discoveries [2,27, 28, 32], the true novelty of our Constitutive Artificial Neural Networks is
that they autonomously discover a constitutive model, and, at the same time, learn a set of physically
meaningful parameters associated with it.
Interestingly, the first Neural Network for constitutive modeling approximates the incremental
principal strains in concrete from known principal strains, stresses, and stress increments and is
more than three decades old [17]. In the early days, Neural Networks served merely as regres-
sion operators and were commonly viewed as a black box. The lack of transparency is probably
the main reason why these early approaches never really generated momentum in the constitu-
tive modeling community. More than 20 years later, data-driven constitutive modeling gained
new traction, in part powered by a new computing paradigm, which directly uses experimental
data and bypasses constitutive modeling altogether [26]. While data-driven elasticity builds on
a transparent and rigorous mathematical foundation [9], it can also become fairly complex, espe-
cially when expanding the theory to anisotropic [13] or history-dependent [14] materials. Rather
than following this path and eliminate the constitutive model entirely, here we attempt to build
our prior physical knowledge into the Neural Network and learn something about the constitu-
tive response [1].
Two successful but fundamentally different strategies have emerged to integrate physical knowl-
edge into network modeling, Physics-Informed Neural Networks that add physics equations as ad-
ditional terms to the loss function [24] and Constitutive Artificial Neural Networks that explicitly
modify the network input, output, and architecture to hardwire physical constraints into the net-
work design [28]. The former approach is more general and typically works well for incorporating
ordinary [29] or partial [41] differential equations, while the latter is specifically tailored towards
constitutive equations [30]. In fact, one such Neural Network, with strain invariants as input,
free energy functions as output, and a single hidden layer with logistic activation functions in be-
tween, has been proposed for rubber materials almost two decades ago [46] and recently regained
attention in the constitutive modeling community [55]. While these Constitutive Artificial Neural
Networks generally provide excellent fits to experimental data [6,36,52], exactly how they should
integrate thermodynamic constraints remains a question of ongoing debate.
Thermodynamics-based Artificial Neural Networks a priori build the first and second law of ther-
modynamics into the network architecture and select specific activation functions to ensure com-
pliance with thermodynamic constraints [32]. Recent studies suggest that this approach can suc-
cessfully reproduce the constitutive behavior of rubber-like materials [18]. Alternative approaches
use a regular Artificial Neural Network and ensure thermodynamic consistency a posteriori via a
pseudo-potential based correction in a post processing step [25]. To demonstrate the versatility of
these different approaches, several recent studies have successfully embedded Neural Networks
within a Finite Element Analysis, for example, to model plane rubber sheets [28] or entire tires [46],
the numerical homogenization of discrete lattice structures [33], the deployment of parachutes [2],
2