Coalescence and sampling distributions for Feller diffusions

2025-04-29 0 0 789.14KB 32 页 10玖币
侵权投诉
Coalescence and sampling distributions for Feller
diffusions
Conrad J. Burdena, Robert C. Griffithsb
aMathematical Sciences Institute, Australian National University, Canberra, Australia
bSchool of Mathematics, Monash University, Australia
Abstract
Consider the diffusion process defined by the forward equation ut(t, x) =
1
2{xu(t, x)}xx α{xu(t, x)}xfor t, x 0 and −∞ < α < , with an initial
condition u(0, x) = δ(xx0). This equation was introduced and solved by
Feller to model the growth of a population of independently reproducing
individuals. We explore important coalescent processes related to Feller’s
solution. For any αand x0>0 we calculate the distribution of the random
variable An(s;t), defined as the finite number of ancestors at a time sin
the past of a sample of size ntaken from the infinite population of a Feller
diffusion at a time tsince since its initiation. In a subcritical diffusion we
find the distribution of population and sample coalescent trees from time t
back, conditional on non-extinction as t→ ∞. In a supercritical diffusion
we construct a coalescent tree which has a single founder and derive the
distribution of coalescent times.
Keywords: Coalescent, Diffusion process, Branching process, Feller
diffusion, Sampling distributions
1. Introduction
Feller (1939, 1951a, Section 5) introduced the process governed by the
diffusion equation
ut(t, x) = 1
2{xu(t, x)}xx α{xu(t, x)}x,0x < , α R,(1)
Email addresses: conrad.burden@anu.edu.au (Conrad J. Burden),
Bob.Griffiths@Monash.edu (Robert C. Griffiths)
Preprint submitted to Theoretical Population Biology September 13, 2023
arXiv:2210.12894v2 [math.PR] 12 Sep 2023
as the continuum analogue of a classical branching process, primarily with
the intention of modelling the growth of a population of statistically inde-
pendently reproducing individuals. This stochastic process has subsequently
found broader applications in biology, genetics, ecology, nuclear physics, sta-
tistical physics, seismology and finance (see Gan and Waxman (2015, Section
I) for a detailed list of references).
In the present paper we are interested in the Feller diffusion as the contin-
uum limit of a Bienaym´e-Galton-Watson (BGW) process, and applications
to population genetics including sampling distributions of populations under-
going neutral mutations between multiple types. Our approach is based on
the coalescent. In common with the diffusion limit of the Wright-Fisher and
Moran coalescent trees, the coalescent of a Feller diffusion has the property
that it ‘comes down from infinity’. That is to say, the entire, uncountably
infinite, population at any time chosen as the present has a finite number
of ancestral lineages at any positive time in the past (see Fig. 1). The key
step in our analysis is Theorem 1 in which we calculate the distribution of
the discrete random variable A(s;t), defined to be the number of ancestors
at an earlier time tsof the entire population at the current time tsince
initiation of the process.
Starting from this theorem we follow two paths. In the first we consider
the limit t→ ∞ at fixed sof a subcritical Feller diffusion conditioned on sur-
vival of the population. Results for expected inter-coalescent waiting times
for a random sample agree with a recent, less straightforward derivation using
the Laplace transform by Burden and Griffiths (2023).
The second path involves taking the limit s0 at fixed t. At any
s > 0 we find that the ancestry of the process over the interval [0, t s] is
equivalent to the ancestry of a homogeneous birth-death (BD) process with
s-dependent birth and death rates. Both of these rates become infinite as
s0 while their difference remains constant and equal to the parameter
α. Useful tools for characterising the properties of coalescent trees generated
by a homogeneous birth-death (BD) process are the reconstructed process
(RP) (Nee et al., 1994) and the reversed-reconstructed process (RRP) (Al-
dous and Popovic, 2005; Gernhard, 2008; Stadler, 2009; Wiuf, 2018; Ignatieva
et al., 2020), which assumes an improper prior on the time since initiation
of the process, starting with a single ancestor. We construct the coalescent
tree for a Feller diffusion as the limit of a RP from a linear BD process, and
from this construct a RRP for the supercritical process and hence calculate
the distribution of coalescent times.
2
time
xx x x xx
founders at time 0
exponentially distributed families at time t
X(0) = x0
X(t)
Figure 1: Structure of the ancestral tree of a Feller diffusion conditioned on X(0) = x0.
The ancestry of the population X(t) “comes down from infinity”, that is, the number of
ancestors (black lines) of the currently observed infinite population at any finite time in the
past is finite. The blue lines either side represent the entire, infinite population at earlier
times, most of whose descendant lines become extinct by time t. The final population is
the sum of a Poisson number of independent exponentially distributed family sizes (see
Eq. (7)).
Previously Burden and Simon (2016) and Burden and Soewongsono (2019)
have studied coalescence of the supercritical Feller diffusion from a frequen-
tist point of view, in the sense that they calculate confidence intervals on the
time since initiation and initial parameters of the Feller diffusion in terms of
a later observation. The current paper is complementary in that the analysis
of the supercritical diffusion is Bayesian in that the RRP requires a uniform
prior on the time since initiation of the ancestral tree.
The Feller diffusion is implicit within the near-critical continuum limit of a
continuous-time BGW process studied by O’Connell (1995) and Harris et al.
(2020), who also calculate distributions of coalescent times. However their
analysis differs in that it assumes a process initiated by a single progenitor,
their near-critical limit consists of taking the time since initiation to infinity
while keeping the branching time and reproduction rate constant, and results
3
are stated in terms of the fraction of time since initiation. These differences
make direct comparison with the current work problematic, though within
the text of this paper we will note connections with O’Connell (1995) where
appropriate.
Crespo et al. (2021) analyse coalescence in the infinite population limit
of a RRP constructed from a BD process. Their limiting process differs from
the s0 diffusion limit in the current paper in that the time axis is rescaled.
Numerical simulations of their coalescent model employ further approxima-
tions which amount to setting the population size as a deterministic function
rather a stochastic process.
The layout of the paper is as follows. In Section 2 we summarise prop-
erties of the Feller diffusion including its connection with continuum limits
of BD and BGW processes and the interpretation of Feller’s solution. This
establishes known results and a notation required for subsequent sections. In
Section 3 distributions are derived for An(s, t), defined to be the number of
ancestors at an earlier time tsof the entire population at the current time
tsince initiation of the process, and for the corresponding count A(s;t) of
the number of ancestors of the entire population at time t. The limit t→ ∞
at fixed sof a sub-critical Feller diffusion conditioned on survival and the
quasi-stationary sampling distribution are dealt with in Section 4. The limit
s0 at fixed tand the connection between the RP of a BD process and the
coalescent tree of a Feller diffusion are dealt with in Section 5. In Section 6
the coalescent tree for the entire population generated by a a supercritical
Feller diffusion is constructed as a RRP, from which the distributions of co-
alescent times for the population and for a finite sample are calculated in
Section 7. Results are summarised in Section 8.
2. Properties of the Feller Diffusion
For any bounded continuous function gwith second derivatives existing,
a standard backward Kolmogorov equation for a stochastic process X(t) is
d
dtE[g(X(t))] = E[Lg(X(t))] ,
where the right side is the expectation of the function hdefined by h=Lg.
The operator Lis referred to as the generator of the process. In general, if a
4
continuous random variable X(t) evolves so that
E[X(t+δt)X(t)|X(t) = x] = a(x)δt +o(δt),
E(X(t+δt)X(t))2|X(t) = x=b(x)δt +o(δt),
E(X(t+δt)X(t))k|X(t) = x=o(δt), k 3,
(2)
as δt 0, then the generator takes the form (Karlin and Taylor, 1981, p214;
Ewens, 2004, Chapter 4)
L=1
2b(x)2
x2+a(x)
x.
For the particular case a(x) = αx,b(x) = xwe refer to the process with
generator
L=1
2x2
x2+αx
x,(3)
as a Feller diffusion (Feller, 1951a,b). The process is said to be sub-critical,
critical or super-critical if α < 0, α= 0 or α > 0 respectively. A Feller
diffusion can arise as a limit of a BD process or as a limit of a BGW process.
2.1. Feller Diffusion as the limit of a BD process
Baake and Wakolbinger (2015) argue that Feller (1951a) regarded Eq. (1)
primarily in terms of the large-population, near-critical limit of a continuous-
time BD process. In Proposition 1 we set out details of how this limit scales,
using notation which will prove useful in Section 5.
Consider a linear BD process with birth rate ˆ
λ(ϵ) and death rate ˆµ(ϵ)
specified as functions of a parameter ϵR0with the property
ˆ
λ(ϵ) = 1
2ϵ1+1
2α+O(ϵ),
ˆµ(ϵ) = 1
2ϵ11
2α+O(ϵ),
as ϵ0. If Mϵ(t) is the number of particles alive at time t, then for
n, y = 0,1,2, . . .,
pn,y := P(Mϵ(t+δt) = y|Mϵ(t) = n)
=
nˆ
λδt +O(δt2), y =n+ 1;
1n(ˆ
λ+ ˆµ)δt +O(δt2), y =n;
nˆµδt +O(δt2), y =n1;
O(δt2),otherwise.
(4)
5
摘要:

CoalescenceandsamplingdistributionsforFellerdiffusionsConradJ.Burdena,RobertC.GriffithsbaMathematicalSciencesInstitute,AustralianNationalUniversity,Canberra,AustraliabSchoolofMathematics,MonashUniversity,AustraliaAbstractConsiderthediffusionprocessdefinedbytheforwardequationut(t,x)=12{xu(t,x)}xx−α{x...

展开>> 收起<<
Coalescence and sampling distributions for Feller diffusions.pdf

共32页,预览5页

还剩页未读, 继续阅读

声明:本站为文档C2C交易模式,即用户上传的文档直接被用户下载,本站只是中间服务平台,本站所有文档下载所得的收益归上传人(含作者)所有。玖贝云文库仅提供信息存储空间,仅对用户上传内容的表现方式做保护处理,对上载内容本身不做任何修改或编辑。若文档所含内容侵犯了您的版权或隐私,请立即通知玖贝云文库,我们立即给予删除!
分类:图书资源 价格:10玖币 属性:32 页 大小:789.14KB 格式:PDF 时间:2025-04-29

开通VIP享超值会员特权

  • 多端同步记录
  • 高速下载文档
  • 免费文档工具
  • 分享文档赚钱
  • 每日登录抽奖
  • 优质衍生服务
/ 32
客服
关注