Bayesian phylodynamic inference of population dynamics with dormancy

成果类型:
Article
署名作者:
Cappello, Lorenzo; Lo, Wai Tung 'Jack'; Zhang, Joy Z.; Xu, Peiyu; Barrow, Daniel; Chopra, Ishani; Clark, Andrew G.; Wells, Martin T.; Kim, Jaehee
署名单位:
Pompeu Fabra University; Barcelona School of Economics; Cornell University; Cornell University; Cornell University; Cornell University
刊物名称:
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA
ISSN/ISSBN:
0027-10568
DOI:
10.1073/pnas.2501394122
发表日期:
2025-05-06
关键词:
stress-induced mutagenesis mycobacterium-tuberculosis migration rates mutation-rate dna-repair coalescent HISTORY MODEL birth reproduction
摘要:
Many organisms employ reversible dormancy, or seedbank, in response to environmental fluctuations. This life-history strategy alters fundamental ecoevolutionary forces, leading to distinct patterns of genetic diversity. Two models of dormancy have been proposed based on the average duration of dormancy relative to coalescent timescales: weak seedbank, induced by scheduled seasonality (e.g., plants, invertebrates), and strong seedbank, where individuals stochastically switch between active and dormant states (e.g., bacteria, fungi). The weak seedbank coalescent is statistically equivalent to the Kingman coalescent with a scaled mutation rate, allowing the use of existing inference methods. In contrast, the strong seedbank coalescent differs fundamentally, as only active lineages can coalesce, while dormant lineages cannot. Additionally, dormant individuals typically mutate at a slower rate than active ones. Consequently, despite the significant role of dormancy in the ecoevolutionary dynamics of many organisms, no methods currently exist for inferring population dynamics involving dormancy and associated parameters. We present a Bayesian framework for jointly inferring a latent genealogy, seedbank parameters, and evolutionary parameters from molecular sequence data under the strong seedbank coalescent. We derive the exact probability density of genealogies sampled under the strong seedbank coalescent, characterize the corresponding likelihood function, and present efficient computational algorithms for its evaluation based on our theoretical framework. We develop a tailored Markov chain Monte Carlo sampler and implement our inference framework as a package SeedbankTree within BEAST2. Our work provides both a theoretical foundation and practical inference framework for studying the population genetic and genealogical impacts of dormancy.