
Model exploration in gravitational-wave astronomy with the
maximum population likelihood
Ethan Payne1, 2, 3, 4, aand Eric Thrane3, 4, b
1Department of Physics, California Institute of Technology, Pasadena, California 91125, USA
2LIGO Laboratory, California Institute of Technology, Pasadena, California 91125, USA
3School of Physics and Astronomy, Monash University, VIC 3800, Australia
4OzGrav: The ARC Centre of Excellence for Gravitational-Wave Discovery, Clayton, VIC 3800, Australia
Hierarchical Bayesian inference is an essential tool for studying the population properties of com-
pact binaries with gravitational waves. The basic premise is to infer the unknown prior distribution
of binary black hole and/or neutron star parameters such component masses, spin vectors, and
redshift. These distributions shed light on the fate of massive stars, how and where binaries are
assembled, and the evolution of the Universe over cosmic time. Hierarchical analyses model the
binary black hole population using a prior distribution conditioned on hyper-parameters, which are
inferred from the data. However, a misspecified model can lead to faulty astrophysical inferences.
In this paper we answer the question: given some data, which prior distribution––from the set of all
possible prior distributions––produces the largest possible population likelihood? This distribution
(which is not a true prior) is –
π(pronounced “pi stroke”), and the associated maximum population
likelihood is –
L(pronounced “L stroke”). The structure of –
πis a linear superposition of delta func-
tions, a result which follows from Carath´eodory’s theorem. We show how –
πand –
Lcan be used for
model exploration/criticism. We apply this –
Lformalism to study the population of binary black hole
mergers observed in LIGO–Virgo–KAGRA’s third Gravitational-Wave Transient Catalog. Based on
our results, we discuss possible improvements for gravitational-wave population models.
I. MOTIVATION
Bayesian inference has become a mainstay of mod-
ern scientific data analysis as a means of analysing sig-
nals in noisy observations. This procedure determines
the posterior distributions for parameters given one or
more model. In order to study the population prop-
erties of a set of uncertain observations, a hierarchi-
cal Bayesian framework can be employed. The basic
idea is to model the population using a conditional prior
π(θ|Λ, M), which describes, for example, the distribution
of black hole masses {m1, m2} ∈ θgiven some hyper-
parameters Λ, which determine the shape of the prior
distribution. Here, Mdenotes the choice of model. One
then carries out Bayesian inference using a “population
likelihood”
L(d|Λ, M) =
N
Y
i
1
ξ(Λ) ZdθiL(di|θi)π(θi|Λ, M),(1)
where L(di|θi) is the likelihood for data associated with
event igiven parameters θi, and ξ(Λ) is the detected
fraction for a choice of hyper-parameters. Meanwhile,
Nis the total number of observations. For an overview
of hierarchical modeling in gravitational-wave astronomy
including selection effects, see Refs. [1–3].
The LIGO-Virgo-KAGRA (LVK) Collaboration’s
third gravitational-wave transient catalog (GWTC-3) [4]
contains the cumulative set of observations of N= 69
aepayne@caltech.edu
beric.thrane@monash.edu
confident binary black-hole mergers [5] detected by the
LVK [6–8]. Additional detection candidates have been
put forward by independent groups [9–13]. Hierarchical
inference is employed to study the population properties
these merging binary black holes; see, e.g., Refs. [14–32].
These analyses have revealed a number of exciting re-
sults, such as the surprising excess rate of mergers with
a primary black hole mass of ∼35 M[15], and the evo-
lution of the binary merger rate with redshift [16], to
name just two.
However, Bayesian inference has its limitations. One
can use Eq. (1) in order to infer the distribution of binary
black hole parameters—given some model ; and one can
compare the marginal likelihoods of two models to see
which one better describes the data. However, Bayesian
inference does not tell us if any of the models we are using
are suitable descriptions of the data. While all models for
the distribution of binary black hole parameters are likely
to be imperfect, some may be adequate for describing our
current dataset [33]. When a model fails to capture some
salient feature of the data, it is said to be “misspecified”
[34,35]. Some effort has been made to assess the suit-
ability of gravitational-wave models, both qualitatively
and quantitatively; see, e.g., [15,16,34,36]. However,
the idea of “model criticism”—testing the suitability of
Bayesian models—is still being developed within the con-
text of gravitational-wave astronomy and beyond.
Hierarchical Bayesian inference studies often depend
upon parametric models. Modelers design parameteri-
zations in order to capture the key features of the as-
trophysical distributions. However, one must still worry
about “unknown unknowns”—features which do not oc-
cur to the modeler to add. For example, recent stud-
ies [15,16,37,38] find a sub-population of binary black
arXiv:2210.11641v2 [astro-ph.IM] 22 Feb 2023