Genomic Prediction Using Individual-Level Data and Summary Statistics from Multiple Populations

Jeremie Vandenplas, Mario P L Calus, Gregor Gorjanc

Research output: Contribution to journalArticlepeer-review

Abstract / Description of output

This study presents a method for genomic prediction that uses individual-level data and summary statistics from multiple populations. Genome-wide markers are nowadays widely used to predict complex traits, and genomic prediction using multi-population data is an appealing approach to achieve higher prediction accuracies. However, sharing of individual-level data across populations is not always possible. We present a method that enables integration of summary statistics from separate analyses with the available individual-level data. The data can either consist of individuals with single or multiple (weighted) phenotype records per individual. We developed a method based on a hypothetical joint analysis model and absorption of population specific information. We show that population specific information is fully captured by estimated allele substitution effects and the accuracy of those estimates, i.e. the summary statistics. The method gives identical result as the joint analysis of all individual-level data when complete summary statistics are available. We provide a series of easy-to-use approximations that can be used when complete summary statistics are not available or impractical to share. Simulations show that approximations enables integration of different sources of information across a wide range of settings yielding accurate predictions. The method can be readily extended to multiple-traits. In summary, the developed method enables integration of genome-wide data in the individual-level or summary statistics form from multiple populations to obtain more accurate estimates of allele substitution effects and genomic predictions.

Original languageEnglish
Pages (from-to)53-69
JournalGenetics
Volume210
Issue number1
Early online date18 Jul 2018
DOIs
Publication statusPublished - 1 Sept 2018

Keywords / Materials (for Non-textual outputs)

  • meta-analysis
  • quantitative trait
  • statistical method

Fingerprint

Dive into the research topics of 'Genomic Prediction Using Individual-Level Data and Summary Statistics from Multiple Populations'. Together they form a unique fingerprint.

Cite this