
Introduction
The fitness landscape is a very influential metaphor introduced by Wright [1, Fig 2] to
describe evolution as explorations through a “field of possible genes combinations”,
where high values of fitness are represented as peaks separated by valleys of lower
fitness. The topography of the fitness landscape has important evolutionary
consequences, e.g. speciation via reproductive isolation [2]. Within this framework, the
evolution of any population can be conceptualised as adaptive walks driven by
successive mutations constrained by incremental or neutral fitness steps. Thus, in the
absence of additional evolutionary forces such as drift or environmental variations, once
a population reaches a local peak, natural selection hampers any further mutational
paths that decrease fitness. However, there are empirical evidences showing that
populations do not stop indefinitely at a local peak and can explore alternative
trajectories on the landscape [3–7], ergo, the following question arises: How does
evolution escape from a local peak to a fitter one?
Since it has been formulated, considerable progress have been made on this
question [8
–
22]. However, conventional theoretical approaches still state that a genotype
mutates into another through point mutations (e.g. single-nucleotide variations). If one
takes a look at the molecular scale of DNA and the different mutation types, this may
seem contradictory as it is well known that many other kinds of variation operators
(including insertions, deletions, duplications, translocations and inversions) act on the
genome. Hence, a fundamental aspect of this challenge is to understand—at the scale of
molecular evolution—the roles played by these different mutation types. Indeed, there is
a gap between the theoretical models, that account for a very limited set of mutations
types—typically only point mutations—and the reality of molecular evolution, where
multiple variation operators act on the sequence.
As a contribution to bridge this gap, we present a minimal DNA-inspired
mechanistic model for inversion mutations, and explore their relationship with the
escape dynamics from local fitness peaks. Inversion mutations are one of the most
frequent chromosomal rearrangements [23, Ch. 17.2] with lengths covering a wide range
of sizes. For example, in Long Term Evolution Experiments with E. coli, chromosomal
rearrangements have been characterised by optical mapping (hence limiting the
resolution to rearrangements larger than 5000 bp) [24]. In this study, 75% of evolved
populations showed inversion events—ranging in size from ∼164 Kb to ∼1.8 Mb [24]
(for other examples of large inversions in different clades, see [25, Table 1 ]). With the
development of novel sequencing technologies [26–28], it has been possible to identify
intragenic (submicroscopic) DNA inversions, for example, an inversion of seven
nucleotides in mitochondrial DNA, resulting in the alteration of three amino acids and
associated to an unusual mitochondrial disorder [29, Fig 1]. Intragenic inversions have
also been suggested to be an important mechanism implied in the evolution of
eukaryotic cells [30]. Although chromosomal inversions are ubiquitous in many
evolutionary processes [23–27,29–38], very little is known about their theoretical
description and computational simulation at the sequence level, as models generally
focus on very large inversions (typically larger than a single gene), hence on their effect
on synteny (or the deleterious effects at breakpoints), but neglect the possibility that
small inversions occur inside coding sequences [39].
Here, we simulate a representation of molecular evolution of digital organisms
(replicators), each of which contains a single piece of DNA. We engineer a
computational method to cartoon the double-stranded structure of DNA, and simulate
inversion-like mutations consisting of a permutation of a segment of the complementary
strand, which is then exchanged with the main strand segment (see Methods, schema 6).
For the sake of simplicity, we consider digital genotypes made up of binary nucleotides
(i.e. a binary alphabet {0,1}instead of the four-nucleotides alphabet {A,T,C,G}). The
2/35