Optimal Control of Material Micro-Structures
Aayushman Sharma, Zirui Mao, Haiying Yang, Suman Chakravorty, Michael J Demkowicz, Dileep Kalathil
Abstract— In this paper, we consider the optimal control
of material micro-structures. Such material micro-structures
are modeled by the so-called phase field model. We study
the underlying physical structure of the model and propose
a data based approach for its optimal control, along with a
comparison to the control using a state of the art Reinforcement
Learning (RL) algorithm. Simulation results show the feasibility
of optimally controlling such micro-structures to attain desired
material properties and complex target micro-structures.
I. INTRODUCTION
In this paper, we consider the optimal control of complex
and high DoF (Degree of Freedom) material micro-structure
dynamics, by doing which we aim at developing a robust
tool for exploring the fundamental limits of materials micro-
structure control during processing. The micro-structure
dynamics process is governed by the so-called Phase Field
(PF) model [1], which is a powerful tool for modeling
micro-structure evolution of materials. It represents the
spatial distribution of physical quantities of interest by an
order parameter field governed by one or a set of partial
differential equations (PDEs). The Phase Field method
can naturally handle material systems with complicated
interface, thanks to which it has been successfully applied to
the simulation of a wide range of materials micro-structure
evolution processes, such as e.g., alloy solidification [2][3],
alloy phase transformation [4][5], grain growth[6][7], and
failure mechanism of materials [8][9] etc.
The phase field model is complex, nonlinear, and high
DOF, and does not admit analytical solutions. Thus, data
based approaches are natural to consider for the control of
these systems. In recent work [10], we proposed a novel
decoupled data based control (D2C) algorithm for learning
to control an unknown nonlinear dynamical system. Our
approach introduced a rigorous decoupling of the open-loop
(planning) problem from the closed-loop (feedback control)
problem. This decoupling allows us to come up with a highly
efficient approach to solve the problem in a completely data
based fashion. Our approach proceeds in two steps: (i) first,
we optimize the nominal open-loop trajectory of the system
using a ‘blackbox simulation’ model, (ii) then we identify
the linear system governing perturbations from the nominal
trajectory using random input-output perturbation data,
and design an LQR controller for this linearized system.
We have shown that the performance of D2C algorithm
is approximately optimal, in the sense that the decoupled
design is near optimal to second order in a suitably defined
noise parameter [10]. In this work, we apply the D2C
approach to the optimal control of the evolution of material
micro-structures governed by the phase field model. For
comparison, we consider RL techniques [11]. The past
several years have seen significant progress in deep neural
networks based reinforcement learning approaches for
controlling unknown dynamical systems, with applications
in many areas like playing games [12], locomotion [13]
and robotic hand manipulation [14]. A number of new
algorithms that show promising performance are proposed
[15][16][17] and various improvements and innovations have
been continuously developed. However, despite excellent
performance on a number of tasks, RL is highly data
intensive. The training time for such algorithms is typically
very large, and high variance and reproducibility issues mar
the performance [18]. Thus, materials microstructure control
is intractable for current RL techniques. Recent work in
RL has also focused on PDE control using techniques
like the Deep Deterministic Policy Gradient(DDPG) with
modifications such as action descriptors to limit the large
action spaces in infinite dimensional systems [19], as such
algorithms tend to degrade in performance with an increase
in dimensionality of the action-space. However, such an
approach, in general, is not feasible for the Microstructure
problem.
The control design of systems governed by PDEs are
known to be computationally intractable [20]. Owing to
the infinite dimensionality of the system, typically model
reduction techniques are used to reduce the size of the
state space. The most widely used method is the Proper
Orthogonal Decomposition (POD) method that finds a
reduced basis for the approximation of the nonlinear PDE
using a simulation of the same [21]. However, in general,
the basis eigenfunctions can be quite different around the
optimal trajectory when compared to an initial trajectory,
and an approximation using the latter’s reduced basis can
lead to unacceptable performance [22],[23]. In our approach,
the nominal optimization problem is directly solved which
does not need any reduced order modeling. Furthermore,
the feedback design is accomplished by identifying a linear
time-varying (LTV) model which automatically produces
a reduced order model (ROM) of the system based purely
on the input output data while facilitating the use of
linear control design techniques such as LQG/ILQR on the
problem as opposed to having to solve a nonlinear control
problem in the methods stated previously. Furthermore, the
entire control design for the problem is done purely in a
data based fashion since the technique only queries a black
box simulation model of the process.
arXiv:2210.06734v1 [eess.SY] 13 Oct 2022