4 KATHARINA T. HUBER AND LIAM J. MAHER
paths from rto the leaves. With this in mind, we say that a ploidy profile ~m= (m1,...,mn),
n≥1 on X={x1,...,xn}is realized by a phylogenetic network Nwith leaf set Xif, for all
1≤i≤n, the number of directed paths from the root of Nto xiis mi. For example, both
phylogenetic networks pictured in Figure 1realize the ploidy profile ~m= (14,12,12,10)
indexed by X={x1,x2,x3,x4}.
Contributing to the emerging field of Polyploid Phylogenetics [29], a first inroad into our
question was made in [11] by studying the hybrid number of a ploidy profile ~m, that is, the
minimal number of polyploidization events required by a phylogenetic network to realize
~m. As it turns out, the arguments underlying the results in [11] largely rely on a certain
iteratively constructed network that realizes ~m. Denoting for a choice Cof initializing
network the generated network by N(~m) = NC(~m)and changing the network initializing
that construction in a way that does not affect the main findings in [11] (see below for
details), we show that even more can be said about ploidy profiles. For example, our first
result (Proposition 4.1) shows that N(~m)may be thought of as a generator of ploidy profile
space (defined in a similar way as phylogenetic tree space) in the sense that any realization
of ~mcan be reached from N(~m)via a number of multiple-labelled tree editing operations
and reticulation vertex splitting operations. As an immediate consequence of this we obtain
a distance measure for phylogenetic networks that realize one and the same ploidy profile.
On a more speculative level it might be interesting to see if N(~m)lends itself as a useful
prior for a Bayesian method along the lines as described in [35].
Our second result (Theorem 6.1) shows that a key concept introduced in [11] called the
simplification sequence of a ploidy profile ~mis in fact closely related to the notion of a
cherry reduction sequence [7] for N(~m), also called a cherry picking sequence in [19]. In
case autopolyploidy is not suspected to have played a role in the evolution of a dataset, this
implies that the network N(~m)can also be reconstructed from phylogenetic networks on
three leaves called trinets [32]. These can be obtained from a dataset using, for example,
the TriLoNet software [26].
Exemplified in terms of the phylogenetic network depicted in Figure 1(ii) for the ploidy
profile (14,12,12,10), our third result (Theorem 6.2) implies that for any ploidy profile
we can always find a phylogenetic network realizing it in the form of a phylogenetic tree
that potentially contains beads to which additional arcs have been added and at most one
of those arcs is not horizontal. In the context of this it is important to note that, in general,
a phylogenetic network cannot be thought of as a phylogenetic tree with additional arcs let
alone horizontal ones. The reason for this is that horizontal arcs imply that the ancestral
taxa joined by such an arc must have existed at the same time (see also [33, Section 10.3.3]
for more on this and the Viola dataset below for an example).
The remainder of the paper is organized as follows. In the next section, we review rel-
evant basic terminology surrounding graphs, phylogenetic networks and ploidy profiles.
For a ploidy profile ~m, we outline the construction of the network N(~m)in Section 3. This
includes the definition of the simplification sequence for ~m. Subsequent to this, we intro-
duce ploidy profile space in Section 4and also establish Proposition 4.1 in that section.
Sections 6is concerned with establishing Theorems 6.1 and 6.2. To do this, we use The-
orem 5.1 which we establish in Section 5. That theorem is underpinned by the concept
of a so called HGT-consistent labelling introduced in [37], a notion that we extend to our
types of phylogenetic networks here. In the last but one section, we employ a simplified