Reproducibility of preclinical animal research improves with heterogeneity of study samples

Bernhard Voelkl, Lucile Vogt, Emily S Sena, Hanno Würbel

Research output: Contribution to journalArticlepeer-review


Single-laboratory studies conducted under highly standardized conditions are the gold standard in preclinical animal research. Using simulations based on 440 preclinical studies across 13 different interventions in animal models of stroke, myocardial infarction, and breast cancer, we compared the accuracy of effect size estimates between single-laboratory and multi-laboratory study designs. Single-laboratory studies generally failed to predict effect size accurately, and larger sample sizes rendered effect size estimates even less accurate. By contrast, multi-laboratory designs including as few as 2 to 4 laboratories increased coverage probability by up to 42 percentage points without a need for larger sample sizes. These findings demonstrate that within-study standardization is a major cause of poor reproducibility. More representative study samples are required to improve the external validity and reproducibility of preclinical animal research and to prevent wasting animals and resources for inconclusive research.

Original languageEnglish
Pages (from-to)e2003693
JournalPLoS Biology
Issue number2
Publication statusPublished - 22 Feb 2018


  • Journal Article


Dive into the research topics of 'Reproducibility of preclinical animal research improves with heterogeneity of study samples'. Together they form a unique fingerprint.

Cite this