
KCL-PH-TH/2022-41
Dictionary learning: a novel approach to detecting binary black holes in the presence
of Galactic noise with LISA
Charles Badger,1Katarina Martinovic,1Alejandro Torres-Forn´e,2, 3 Mairi Sakellariadou,1and Jos´e A. Font2, 3
1Theoretical Particle Physics and Cosmology Group, Physics Department,
King’s College London, University of London, Strand, London WC2R 2LS, United Kingdom
2Departamento de Astronom´ıa y Astrof´ısica, Universitat de Val`encia,
Dr. Moliner 50, 46100 Burjassot (Val`encia), Spain
3Observatori Astron`omic, Universitat de Val`encia,
Catedr´atico Jos´e Beltr´an 2, 46980 Paterna (Val`encia), Spain
(Dated: October 17, 2022)
The noise produced by the inspiral of millions of white dwarf binaries in the Milky Way may pose a
threat to one of the main goals of the space-based LISA mission: the detection of massive black hole
binary mergers. We present a novel study for reconstruction of merger waveforms in the presence of
Galactic confusion noise using dictionary learning. We discuss the limitations of untangling signals
from binaries with total mass from 102Mto 104M. Our method proves extremely successful for
binaries with total mass greater than ∼3×103Mup to redshift 3 in conservative scenarios, and
up to redshift 7.5 in optimistic scenarios. In addition, consistently good waveform reconstruction of
merger events is found if the signal-to-noise ratio is approximately 5 or greater.
Introduction.— The LIGO/Virgo interferometer net-
work [1, 2] has already detected gravitational waves
(GWs) from almost one hundred compact binary coa-
lescence (CBC) events [3–5]. These detections populate
GW transient catalogues and reveal information about
properties of the underlying black hole and neutron star
populations [6]. The high frequency range probed by the
current terrestrial detectors, at around (10 −1000) Hz,
is sensitive to stellar-mass binaries that mostly lie be-
low the pair-instability supernova mass gap1. Mergers of
supermassive black holes, however, are expected to emit
low-frequency (mHz) gravitational waves.
The space-based GW interferometer LISA, anticipated
to be launched in the mid-2030s, will be sensitive to GWs
in the mHz range [8]. Other GW sources will be de-
tectable at these frequencies: Galactic white dwarf bi-
naries, inspiraling binaries with extreme mass-ratio, or
colliding true vacuum bubbles formed at the electroweak
phase transition [9–11]. The tens of millions of double
white dwarf binaries in the Galaxy could have an impact
on detectability of massive black hole binaries coalescing
in the LISA frequency band [12]. LISA will observe con-
tinuous GWs from inspiraling white dwarfs, and although
it may be sensitive to individual sources, most will remain
unresolved and these are referred to as Galactic confusion
noise [13–15]. It has been shown that modulation of the
Galactic foreground from the LISA orbit could lead to
a reduction in signal-to-noise ratio (SNR) of other GW
sources by a factor of 4 [16]. A LISA Data Challenge2
is underway to study the impact of overlapping Galactic
1GW190521 is the only exceptional event where the mass of the
primary black hole is unambiguously in the mass gap [7].
2https://lisa-ldc.lal.in2p3.fr.
sources on the sensitivity to massive black hole merg-
ers [17], and attempts to separate the foreground from
other GW sources have been conducted [18–22].
In this Letter we apply a dictionary learning method
to separate CBCs from the Galactic foreground in the
LISA frequency band. Such a method has been success-
fully applied in GW data analysis to classify and denoise
Advanced LIGO’s “blip” noise transients [23] and effec-
tively improve the performance of the detector. More
precisely, we assess the suitability of the dictionary learn-
ing method for the classification and reconstruction of
massive binary black hole merger signals in the presence
of Galactic noise.
Previous studies focused on the inspiral of loud CBC
sources and demonstrated that SNR accumulated over
time is sufficiently large to overcome the noise from
Galactic binaries [24]. In other literature, detectability of
CBCs was investigated for equal-mass and non-spinning
binaries [25–27], confirming the largest SNR is expected
from binaries with combined mass ∼(105−106)M. In
particular, Fig. 3 in [27] presents two mass ranges with
low SNR that could be affected by Galactic foreground,
namely (102−104)Mand (107−109)M.
Here we consider all of the mass ranges, along with
varying spins and redshifts, and we study their wave-
forms around coalescence time. The dictionary learning
method reconstructs CBC signals with ease in the trivial
case where the CBCs are above the Galactic noise, i.e.
for the (105−106)Mmass range. We find the dictionary
learning method to be too computationally expensive for
very heavy mergers in the range (107−109)M. However,
our method succeeds in separating low-SNR binaries in
the range (102−104)Mfrom the Galactic noise. Hence,
the dictionary learning method could significantly assist
the detection of this prime LISA source [28].
arXiv:2210.06194v2 [gr-qc] 14 Oct 2022