
ANOPTIMIZATION-BASED SUPERVISED LEARNING
ALGORITHM FOR PXRD PHASE FRACTION ESTIMATION
A PREPRINT
Patrick Hosein
Department of Computer Science
The University of the West Indies
St. Augustine, Trinidad
patrick.hosein@sta.uwi.edu
Jaimie Greasley
Department of Physics
The University of the West Indies
St. Augustine, Trinidad
jaimie.greasley@gmail.com
October 21, 2022
ABSTRACT
In powder diffraction data analysis, phase identification is the process of determining the crystalline
phases in a sample using its characteristic Bragg peaks. For multiphasic spectra, we must also de-
termine the relative weight fraction of each phase in the sample. Machine Learning algorithms (e.g.,
Artificial Neural Networks) have been applied to perform such difficult tasks in powder diffraction
analysis, but typically require a significant number of training samples for acceptable performance.
We have developed an approach that performs well even with a small number of training samples.
We apply a fixed-point iteration algorithm on the labelled training samples to estimate monophasic
spectra. Then, given an unknown sample spectrum, we again use a fixed-point iteration algorithm
to determine the weighted combination of monophase spectra that best approximates the unknown
sample spectrum. These weights are the desired phase fractions for the sample. We compare our
approach with several traditional Machine Learning algorithms.
Keywords machine learning, x-ray diffraction, phase identification, quantitative phase analysis
Main
The assessment of powder X-ray diffraction (PXRD) spectra is central to many materials investigations. X-ray scatter-
ing data reveals important structural and micro-structural parameters for characterizing a material[7]. The collection
of scattered intensities attributed to labelled crystallographic planes, in fact serves as a fingerprint reference for the
material structure[15]. Phase identification is performed by matching observed Bragg peaks to a powder pattern ref-
erence in a database, usually with the aid of a search-match program. Finding the corresponding reference may not
be easy as different instrument settings or any slight deviation in structure cause variant diffraction profiles for any
given phase. For multi-phase analysis, phase fraction estimation is possible by presuming some relationship between
the observed intensities for each phase in the spectrum. Several method varying in complexity, computational rigor
and sample preparation requirements, are available [20]. These include the use of internal standard calibrations as in
the Reference Intensity Ratio method [9], or whole powder pattern fitting as with full-pattern summation [17,5,4,12]
and Rietveld refinement [16].
Recently, Machine Learning has been used to characterize PXRD spectra [1,6]. Some success has been reported
for phase identification by both conventional Machine Learning models [2,3] as well as Deep Learning architectures
[11,19,10,13,18]. However, few studies have investigated phase quantification [11,10,14].
Previously, we performed PXRD Rietveld characterization of mineral phases for a small batch of urinary stones [8].
Rietveld refinement is a powerful, but time-consuming, pattern fitting procedure which employs least-squares mini-
mization to obtain refined crystallographic parameters for a material, including phase fractions. While full crystallo-
graphic characterization may be useful for research in stone formation, it is not required for a clinical stone analysis
program. Yet, estimated weight fractions serve to aid the analyst in differentiating between primary and secondary
arXiv:2210.10867v1 [cs.LG] 19 Oct 2022