A Comparison of Models to Infer the Distribution of Fitness Effects of New Mutations

Athanasios Kousathanas, Peter D Keightley

Research output: Contribution to journalArticlepeer-review

Abstract

Knowing the distribution of fitness effects (DFE) of new mutations is important for several topics in evolutionary genetics. Existing computational methods to infer the DFE from DNA polymorphism data have frequently assumed that the DFE can be approximated by a unimodal distribution, such as a lognormal or a gamma distribution. However, if the true DFE departs substantially from the assumed distribution (e.g., if the DFE is multimodal), this could lead to misleading inferences about its properties. We conducted simulations to test the performance of parametric and non-parametric discretised distribution models to infer the properties of the DFE for cases in which the true DFE is unimodal, bimodal or multimodal. We found that lognormal and gamma distribution models can perform poorly in recovering the properties of the distribution if the true DFE is bimodal or multimodal, whereas the discretised distribution models provide a better fit. If there is a sufficient amount of data, the discretised models can be used to detect multimodality of the DFE and to accurately infer the mean effect and the average fixation probability of a new deleterious mutation. We fitted several models for the DFE of amino acid-changing mutations using whole-genome data from Drosophila melanogaster and the house mouse subspecies Mus musculus castaneus. A lognormal DFE best explains the data for D. melanogaster, whereas we find evidence for a bimodal DFE in M. m. castaneus.
Original languageEnglish
Pages (from-to)1197-1208
Number of pages12
JournalGenetics
Volume193
Issue number4
DOIs
Publication statusPublished - 1 Apr 2013

Fingerprint

Dive into the research topics of 'A Comparison of Models to Infer the Distribution of Fitness Effects of New Mutations'. Together they form a unique fingerprint.

Cite this