2
drug discovery efforts, we are interested in finding a large
set of distinct favourably-scoring solutions rather than a
single most favourable solution, for two reasons. First,
optimization requires the use of a model scoring function,
often based on experimental data, that can only approx-
imate physical energies (e.g., Rosetta’s REF2015 scor-
ing function [34–37], OSPREY’s energy functions [38],
or molecular dynamics force fields such as CHARMM
[39,40] or AMBER [41,42]). This means that the lowest-
energy state under such a model might not correspond
exactly to the true lowest-energy state, making the set of
slightly higher-scoring states of interest. Second, compu-
tational predictions in drug discovery are best taken as
hypotheses to be tested rather than as reliable facts, and
given that in vitro and in vivo experiments inevitably
have some false negative rate themselves, it is useful to
have a ranked pool of hypotheses (e.g. designed amino
acid sequences likely to perform a particular function, or
predicted docked configurations likely to be maximally
stable) to carry forward for experiments rather than a
single prediction, in order to maximize chances of find-
ing a hit.
Although the importance of both sequence diversity
and molecular structure diversity for drug discovery has
been long understood [43–46], only recently has the di-
versity of sampled solutions from quantum algorithms
been studied. King et al., for example, examined solver
performance with respect to diversity. These authors de-
fined diversity in terms of time-to-all-valleys, a metric
that measures a solver’s ability to sample at least once
from all energy landscape wells containing a ground state
[47]. This metric is specific to problems with many de-
generate (i.e. isoenergetic) ground states. On the other
hand, Mohseni et al. and Zucca et al. studied diversity in
terms of number of near-optimal solutions that have mu-
tual Hamming distance larger than a certain threshold
[48,49]. These studies each showed that the D-Wave
quantum annealing processor can outperform state-of-
the-art classical solvers on certain instances of synthetic
problem classes in sampling diverse, low-energy solutions.
The present study offers two main contributions. First,
we map the multibody molecular docking problem to
a quadratic unconstrained binary optimization (QUBO)
formulation, using a one-hot solution encoding, for eval-
uation by a quantum annealer. The multibody docking
problem occurs repeatedly in drug discovery, both when
predicting structures of oligomeric macromolecular tar-
gets, and when validating designed drugs by predicting
their binding mode to their target (often with accompa-
nying water molecules) [50–52]. Due to the exponential
scaling of the solution space with the number of bodies,
a more efficient means of solving this problem would rep-
resent a major advance for computational drug design
efforts. Second, we provide insight into the ideal tuning
of hyperparameters to permit the quantum annealer to
sample optimal and diverse near-optimal solutions more
efficiently. Specifically, we show that tuning the one-hot
penalty strength to allow intermixing between meaning-
ful (physical) and non-meaningful (non-physical) basis
states can lead to increased sampling of low-energy and
structurally diverse solutions with the D-Wave Advan-
tage quantum annealer.
A. Mapping multibody docking to a quantum
annealer
Here we consider the multibody molecular docking
problem, both as a real-world problem of considerable bi-
ological interest, and as a case study for studying struc-
tural diversity of solutions of a real-world problem as
sampled on a quantum annealer (the D-Wave Advantage
system). This problem involves searching for the optimal
spatial configurations (or “poses”) of N > 2molecules in
a solution space that scales exponentially with N. Of-
ten, this problem is considered at the level of two bod-
ies, as in implicit-solvent protein-ligand docking [53,54].
However, many biological complexes consist of more than
two bodies: for example, hemoglobin is comprised of four
independent protein subunits which closely associate in
complex [55]. Indeed, a large fraction of proteins have
quaternary structure, and nearly all exist at least tran-
siently in complex with other molecules [56]. Even in the
case of protein-ligand docking, water molecules can play
a key role in protein-ligand recognition, so that this is
natively a multibody docking problem [51,57].
Recent classical computational methods [58] such as
HADDOCK [59], ATTRACT [60,61], EROS-DOCK [62]
and MLSD (Multiple Ligands Simultaneous Docking)
[63] attempt to address the importance of the multibody
docking problem, but their performance scales poorly
when a large number of bodies are involved. For exam-
ple, HADDOCK can only carry out simultaneous multi-
body docking for up to six bodies [59]. To deal with the
intensive computational resources required for simultane-
ous multibody docking in the large Nlimit, HADDOCK
instead docks < N bodies in iterative fashion until all
Nbodies are docked. However, this approach fails to
consider the full N-body configuration space, and so is
prone to missing globally-optimal configurations achiev-
able only through simultaneous docking. Quantum paral-
lelism, in contrast, allows simultaneous consideration of
all possible configurations, offering potential advantage
over classical methods.
In this work, we map the multibody molecular dock-
ing problem to a QUBO formulation, allowing sampling
using the D-Wave quantum annealer. The quality of so-
lutions produced by any multibody docking method of
course depends not only on the sampling method but
also the scoring function used. Our approach works for
any energy or scoring function that can be expressed as
a sum of one-body internal energies and two-body in-
teraction energies. For demonstration purposes, we use
the Rosetta REF2015 energy function [36], the excel-
lent accuracy of which allows us to test our predicted
docked configurations against crystallographic data. We