
Towards Structural Reconstruction from X-Ray Spectra
Anton Vladyka,1, ∗Christoph J. Sahle,2, †and Johannes Niskanen1, ‡
1University of Turku, Department of Physics and Astronomy, 20014 Turun yliopisto, Finland
2European Synchrotron Radiation Source, 71 Avenue des Martyrs, 38000 Grenoble, France
(Dated: February 17, 2023)
We report a statistical analysis of Ge K-edge X-ray emission spectra simulated for amorphous
GeO2at elevated pressures. We find that employing machine learning approaches we can reliably
predict the statistical moments of the Kβ00 and Kβ2peaks in the spectrum from the Coulomb ma-
trix descriptor with a training set of ∼104samples. Spectral-significance-guided dimensionality
reduction techniques allow us to construct an approximate inverse mapping from spectral moments
to pseudo-Coulomb matrices. When applying this to the moments of the ensemble-mean spectrum,
we obtain distances from the active site that match closely to those of the ensemble mean and which
moreover reproduce the pressure-induced coordination change in amorphous GeO2. With this ap-
proach utilizing emulator-based component analysis, we are able to filter out the artificially complete
structural information available from simulated snapshots, and quantitatively analyse structural
changes that can be inferred from the changes in the Kβemission spectrum alone.
I. INTRODUCTION
Core-level spectroscopy provides information of struc-
ture of matter at the atomic level, and the constituent
methods are applied from standard material character-
ization to conceptually new experiments at large-scale
facilities such as free-electron lasers. Although refer-
ence data helps, interpretation of core-level spectra is
not always straightforward, especially in the case of soft
condensed or amorphous matter where ensemble statis-
tics plays a drastic role [1–7]. Studies of this statistical
nature, and the implied repeated function evaluations,
could benefit from machine learning (ML), application
of which to core-level spectra has been studied rather
intensively lately [8–15]. In general, when working with
atomic resolution studies have raised the need to engineer
features for both structure [16–20] and spectra [13, 19].
The pressure dependent evolution of the germanium
coordination by oxygen in glassy GeO2has been a long
standing subject of study [21–24]. Besides applications of
amorphous GeO2in technical glasses, the increased sen-
sitivity of a-GeO2to pressure compared to amorphous
SiO2motivates the study of structural changes similar
to those expected to occur in the pressurized analogue
glass a-SiO2but at greatly reduced absolute pressures.
Detailed knowledge of the compaction mechanisms in
these simple glasses will have direct consequences for our
understanding of geological, geochemical, and geophysi-
cal processes involving more complex silicate glasses and
melts.
X-ray emission spectra (XES) of GeO2is an inviting
case for development of spectroscopic analysis for soft
and amorphous condensed matter. First, large spectro-
scopic changes with changing local structure are known
∗anton.vladyka@utu.fi
†christoph.sahle@esrf.fr
‡johannes.niskanen@utu.fi
to exist [24]. Second, simulations are known to repro-
duce the observed ensemble-mean effects well[25]. Third,
XES is local-occupied-orbital derived and a few orbital-
bonding neighbor atoms are expected to be decisive for
the spectrum outcome. This would result in a mini-
mal set of structural parameters needed to predict XES.
Last, owing to the chemical simplicity and simple bond-
ing topology due to non-molecular structure, this system
has promise to be reproduced by ML with the limited
number of data points that the condensed phase allows.
Namely, for such systems the electronic simulation needs
to account for multi-electron effects in numerous interact-
ing atoms – typically on the level of density functional
theory. As a consequence, the number of individual struc-
tural data points for spectroscopy can be expected to be
∼104in an extensive contemporary simulation.
In this work, we focus on Ge KβXES calculations
of amorphous GeO2at elevated pressures. Our previ-
ous work on the water molecule indicated that predicting
spectral features is easier than predicting structural fea-
tures [14]. In the condensed phase, where the structural
features to be predicted are more numerous, the task is
arguably even more complicated. As a solution to this
dilemma, we build a procedure on spectrum prediction
for structures, dimensionality reduction and iterative op-
timization algorithms. This approach is possible because
the evaluation of an ML model requires much less compu-
tational resources than the corresponding quantum me-
chanical calculation does. We predict statistical moments
of XES lines from a Coulomb matrix[16] that describes
the local atomic structure around the site of characteris-
tic X-ray emission. Next, we study obtainable structural
information for the occurring spectral changes in the
pressure progression of the XES by emulator-based com-
ponent analysis (ECA)[15]. Last, we investigate an ap-
proximate solution to the spectrum-to-structure inverse
problem by first transforming it into an optimization task
in the dimension-reduced ECA space, followed by expan-
sion to the full multi-dimensional Coulomb matrix. A
dedicated evaluation data set allows for assessment of
arXiv:2210.13909v2 [cond-mat.dis-nn] 16 Feb 2023