Multi-objective and multi-delity Bayesian optimization of laser-plasma acceleration F. Irshad1S. Karsch1 2and A. D opp1 2 1Ludwig-Maximilian-Universit at M unchen Am Coulombwall 1 85748 Garching Germany

2025-05-02 0 0 1.82MB 10 页 10玖币
侵权投诉
Multi-objective and multi-fidelity Bayesian optimization of laser-plasma acceleration
F. Irshad,1S. Karsch,1, 2 and A. D¨opp1, 2
1Ludwig-Maximilian-Universit¨at M¨unchen, Am Coulombwall 1, 85748 Garching, Germany
2Max Planck Institut f¨ur Quantenoptik, Hans-Kopfermann-Strasse 1, Garching 85748, Germany
Beam parameter optimization in accelerators involves multiple, sometimes competing objectives.
Condensing these individual objectives into a single figure of merit unavoidably results in a bias
towards particular outcomes, in absence of prior knowledge often in a non-desired way. Finding an
optimal objective definition then requires operators to iterate over many possible objective weights
and definitions, a process that can take many times longer than the optimization itself. A more
versatile approach is multi-objective optimization, which establishes the trade-off curve or Pareto
front between objectives. Here we present the first results on multi-objective Bayesian optimiza-
tion of a simulated laser-plasma accelerator. We find that multi-objective optimization reaches
comparable performance to its single-objective counterparts while allowing for instant evaluation of
entirely new objectives. This dramatically reduces the time required to find appropriate objective
definitions for new problems. Additionally, our multi-objective, multi-fidelity method reduces the
time required for an optimization run by an order of magnitude. It does so by dynamically choos-
ing simulation resolution and box size, requiring fewer slow and expensive simulations as it learns
about the Pareto-optimal solutions from fast low-resolution runs. The techniques demonstrated in
this paper can easily be translated into many different computational and experimental use cases
beyond accelerator optimization.
I. INTRODUCTION
Laser-plasma interaction [1,2] and in particular its
sub-field of laser-plasma acceleration [3,4] are highly
researched areas with prospects for numerous scientific
and societal applications [5,6]. Until the past decade,
both experimental and numerical investigations in these
fields were often based on single or a few laser shots
and particle-in-cell simulations [7], respectively. Since
then, improvements in laser technology as well as com-
puting hard- and software have made it possible to gather
data for hundreds or thousands of different configura-
tions in both experiments and simulations [810]. This
has sparked interest in using advanced techniques from
computer science, particularly machine learning meth-
ods, which can deal more efficiently with large multi-
dimensional data sets than human operators [11].
Early examples include the use of genetic algorithms
[12,13] and, more recently, the first measurements using
surrogate models have been presented [14,15]. The latter
are intermediate models that are generated based on ex-
isting data during optimization and that can be quickly
explored numerically. Studies involving this Bayesian
optimization have demonstrated clear optimization of a
carefully chosen optimization goal. Importantly, this goal
has to be encoded in form of a so-called objective func-
tion, which acts on the measurement and gives a scalar
output. In the case of a particle accelerator, the beam can
generally be described by the charge distribution ρ(~x, ~p)
in the six-dimensional phase space, and an objective func-
tion that optimizes beam parameters will act on this dis-
tribution or a subset of it. One of the simplest examples
of an objective function is the charge objective function
gQ(ρ(~x, ~p)) = Zρ(~x, ~p)d~xd~p.
While this function can in principle be used as an ob-
jective function in a particle accelerator, it will usually
not yield a useful optimization result. This is because
it optimizes solely the charge and all other beam pa-
rameters such as divergence and energy are lost in the
integration process. In fact, due to energy conservation,
this optimizer tends to reduce the beam energy, which
is an unintended consequence in almost all conceivable
applications of particle accelerators.
In practice, one usually uses a combination of objec-
tives, e.g. reaching a certain charge above a certain en-
ergy or the total beam energy. The design of objective
functions for these problems is even more difficult be-
cause they need to give some constraints or limits to
the single objectives. Many multi-objective scalarizations
take the form of a weighted product g=Qgαi
ior sum
g=Pαigiof the individual objectives giwith the hyper-
parameter αidescribing its weight. For instance, Jalas
et al. [15] optimized the spectrum of a laser-accelerated
beam using an objective function that combines the beam
charge Q, the median energy ˜
Eand the median absolute
deviation ˜
E. Their proposed objective function to be
maximized is Q˜
E/˜
E, i.e. the exponential weights are
α1= 0.5, α2= 1 and α3=1. Here, the use of median-
based metrics will result in less sensitivity to outliers in
the spectrum, while the weight parameter α1= 0.5 ex-
plicitly reduces the relevance of charge compared to beam
energy and spread.
The choice of particular weights is, however, entirely
empirical and usually the result of trial and error. An ob-
jective function is thus not necessarily aligned with the
actual optimization goal, and one often needs to manu-
ally adjust the parameters of the objective function over
multiple optimization runs. In essence, instead of scan-
ning the input parameters of an experiment or simula-
tion, the human operator will be scanning hyperparame-
arXiv:2210.03484v2 [physics.acc-ph] 23 Dec 2022
2
ters of the objective function many times over. The less
prior knowledge about the system is known, the longer
this process may take.
The underlying problem is essentially one of compres-
sion, i.e. that the objective function needs to reduce a
complex distribution function to a single number char-
acterizing said distribution. It is impossible to do this
without information loss for an unknown distribution
function. In fact, even if we knew the distribution, e.g.
a normal distribution, one would still need both mean
and variance to describe it without ambiguities. In the
case of an unknown one-dimensional distribution func-
tion, we can use multiple statistical descriptions to cap-
ture essential features of the distribution such as the cen-
tral tendency (weighted arithmetic or truncated mean,
the median, mode, percentiles, etc.) and the statistical
dispersion of the distribution (full width at half max-
imum, median absolute deviation, standard deviation,
maximum deviation, etc.). These measures weigh dif-
ferent features in the distribution differently. One may
also include higher-order features such as the skewness,
which occurs for instance as a sign of beam loading in en-
ergy spectra of laser-plasma accelerators [8], or coupling
terms between the different parameters. Last, the am-
plitude or integral of the distribution function are often
parameters of interest [16].
In the following, we will discuss optimizations of elec-
tron energy spectra according to different objective def-
initions and then present a more general multi-objective
optimization.
The paper is structured as follows: First, we are going
to discuss details of the simulated laser-plasma accelera-
tor used for our numerical experiments (Section II) and
introduce Bayesian optimization (Section III). Then we
present results from optimization runs using different def-
initions of scalarized objectives that aim for beams with
high charge and low energy spread at a certain target en-
ergy (Section IV). We then compare these results with an
optimization using effective hypervolume optimization of
all objectives (Section V). In Section VI we discuss some
of the physics that the optimizer ’discovers’ during opti-
mization and in the last section, we summarize our results
and outline perspectives for future research (Section VII).
II. LASER-PLASMA ACCELERATOR
As a test system for optimization, we use an example
from the realm of plasma-based acceleration, i.e. a laser
wakefield accelerator with electron injection in a sharp
density downramp [8,17]. The basic scenario here is that
electrons get trapped in a laser-driven plasma wave due
to a local reduction in the plasma density, which is of-
ten realized experimentally as a transition from one side
to the other of a hydrodynamic shock, hence the often-
used name ”shock injection”. The number of electrons
injected at this density transition strongly depends on
the laser parameters at the moment of injection, but also
°1 0 1 2 3
Position z[mm]
0.0
0.5
1.0
1.5
Plasma density [ne]
z0
ldown
lup
ne
FIG. 1. Illustration of the four variable input parame-
ters from Table I, namely the upramp length lup, the down-
ramp length ldown, the plateau density neand the focus po-
sition z0.
on the plasma density itself. Both parameters also affect
the final energy spectrum the electrons exhibit at the end
of the acceleration process. Here we will use simulations
to investigate this system, the primary reason being that
they are perfectly reproducible and do not require addi-
tional handling of jitter, drifts, and noise. However, the
methods outlined in this paper are equally relevant to
experiments. The input space consists of four variable
parameters, namely the plateau plasma density, the po-
sition of laser focus, as well as the lengths of the up- and
downramps of the plasma density close to the density
transition.
While the shock injection scenario is sufficiently com-
plex to require particle-in-cell codes, we use the code
FBPIC by Lehe et al. [18] in conjunction with various
optimizations to achieve an hour-scale run-time. On the
hardware side, the code is optimized to run on NVIDIA
GPUs (here we used Tesla V100 or RTX3090), while the
physical model includes optimizations such as the usage
of a cylindrical geometry with Fourier decomposition in
the angular direction and boosted-frame moving windows
[19]. Additionally, we can take advantage of the very lo-
calized injection to locally increase the macro-particles
density in the injection area [8]. Similarly, the linear
wakefields forming in regions of lower laser intensity re-
sult in a nearly laminar flow of particles, meaning that
we can decrease the macro-particle density far away from
the laser axis [20].
One particular challenge that arises in simulations over
a large range of parameters is that different input param-
eters may result in different computational requirements.
For instance, the transverse box size needs to be several
times larger than the beam waist to assure that the en-
ergy of a focusing beam is not lost. Hence, a laser that is
initialized out of focus requires a larger box size than a
beam initialized in focus. We address this by scaling the
transverse box size lras a function of the laser waist w(z)
at the beginning of the simulation. Similarly, the size of
the wakefield depends on the plasma density, and accord-
ingly, we scale the longitudinal size lzof the box with the
estimated wakefield size. By using these adapted simu-
摘要:

Multi-objectiveandmulti- delityBayesianoptimizationoflaser-plasmaaccelerationF.Irshad,1S.Karsch,1,2andA.Dopp1,21Ludwig-Maximilian-UniversitatMunchen,AmCoulombwall1,85748Garching,Germany2MaxPlanckInstitutfurQuantenoptik,Hans-Kopfermann-Strasse1,Garching85748,GermanyBeamparameteroptimizationinacce...

展开>> 收起<<
Multi-objective and multi-delity Bayesian optimization of laser-plasma acceleration F. Irshad1S. Karsch1 2and A. D opp1 2 1Ludwig-Maximilian-Universit at M unchen Am Coulombwall 1 85748 Garching Germany.pdf

共10页,预览2页

还剩页未读, 继续阅读

声明:本站为文档C2C交易模式,即用户上传的文档直接被用户下载,本站只是中间服务平台,本站所有文档下载所得的收益归上传人(含作者)所有。玖贝云文库仅提供信息存储空间,仅对用户上传内容的表现方式做保护处理,对上载内容本身不做任何修改或编辑。若文档所含内容侵犯了您的版权或隐私,请立即通知玖贝云文库,我们立即给予删除!
分类:图书资源 价格:10玖币 属性:10 页 大小:1.82MB 格式:PDF 时间:2025-05-02

开通VIP享超值会员特权

  • 多端同步记录
  • 高速下载文档
  • 免费文档工具
  • 分享文档赚钱
  • 每日登录抽奖
  • 优质衍生服务
/ 10
客服
关注