Improving Bayesian Population Dynamics Inference: A Coalescent-Based Model for Multiple Loci

Mandev S. Gill, Philippe Lemey, Nuno R. Faria, Andrew Rambaut, Beth Shapiro, Marc A. Suchard*

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

Abstract

Effective population size is fundamental in population genetics and characterizes genetic diversity. To infer past population dynamics from molecular sequence data, coalescent-based models have been developed for Bayesian nonparametric estimation of effective population size over time. Among the most successful is a Gaussian Markov random field (GMRF) model for a single gene locus. Here, we present a generalization of the GMRF model that allows for the analysis of multilocus sequence data. Using simulated data, we demonstrate the improved performance of our method to recover true population trajectories and the time to the most recent common ancestor (TMRCA). We analyze a multilocus alignment of HIV-1 CRF02_AG gene sequences sampled from Cameroon. Our results are consistent with HIV prevalence data and uncover some aspects of the population history that go undetected in Bayesian parametric estimation. Finally, we recover an older and more reconcilable TMRCA for a classic ancient DNA data set.

Original languageEnglish
Pages (from-to)713-724
Number of pages12
JournalMolecular Biology and Evolution
Volume30
Issue number3
DOIs
Publication statusPublished - Mar 2013

Keywords

  • coalescent
  • smoothing
  • effective population size
  • Gaussian Markov random fields

Cite this