TY - JOUR
T1 - An atlas of genetic scores to predict multi-omic traits
AU - Xu, Yu
AU - Ritchie, Scott C
AU - Liang, Yujian
AU - Timmers, Paul R H J
AU - Pietzner, Maik
AU - Lannelongue, Loïc
AU - Lambert, Samuel A
AU - Tahir, Usman A
AU - May-Wilson, Sebastian
AU - Foguet, Carles
AU - Johansson, Åsa
AU - Surendran, Praveen
AU - Nath, Artika P
AU - Persyn, Elodie
AU - Peters, James E
AU - Oliver-Williams, Clare
AU - Deng, Shuliang
AU - Prins, Bram
AU - Luan, Jian'an
AU - Bomba, Lorenzo
AU - Soranzo, Nicole
AU - Di Angelantonio, Emanuele
AU - Pirastu, Nicola
AU - Tai, E Shyong
AU - van Dam, Rob M
AU - Parkinson, Helen
AU - Davenport, Emma E
AU - Paul, Dirk S
AU - Yau, Christopher
AU - Gerszten, Robert E
AU - Mälarstig, Anders
AU - Danesh, John
AU - Sim, Xueling
AU - Langenberg, Claudia
AU - Wilson, James F
AU - Butterworth, Adam S
AU - Inouye, Michael
N1 - Publisher Copyright:
© 2023, The Author(s), under exclusive licence to Springer Nature Limited.
PY - 2023/3/29
Y1 - 2023/3/29
N2 - The use of omic modalities to dissect the molecular underpinnings of common diseases and traits is becoming increasingly common. But multi-omic traits can be genetically predicted, which enables highly cost-effective and powerful analyses for studies that do not have multi-omics1. Here we examine a large cohort (the INTERVAL study2; n = 50,000 participants) with extensive multi-omic data for plasma proteomics (SomaScan, n = 3,175; Olink, n = 4,822), plasma metabolomics (Metabolon HD4, n = 8,153), serum metabolomics (Nightingale, n = 37,359) and whole-blood Illumina RNA sequencing (n = 4,136), and use machine learning to train genetic scores for 17,227 molecular traits, including 10,521 that reach Bonferroni-adjusted significance. We evaluate the performance of genetic scores through external validation across cohorts of individuals of European, Asian and African American ancestries. In addition, we show the utility of these multi-omic genetic scores by quantifying the genetic control of biological pathways and by generating a synthetic multi-omic dataset of the UK Biobank3 to identify disease associations using a phenome-wide scan. We highlight a series of biological insights with regard to genetic mechanisms in metabolism and canonical pathway associations with disease; for example, JAK-STAT signalling and coronary atherosclerosis. Finally, we develop a portal ( https://www.omicspred.org/ ) to facilitate public access to all genetic scores and validation results, as well as to serve as a platform for future extensions and enhancements of multi-omic genetic scores.
AB - The use of omic modalities to dissect the molecular underpinnings of common diseases and traits is becoming increasingly common. But multi-omic traits can be genetically predicted, which enables highly cost-effective and powerful analyses for studies that do not have multi-omics1. Here we examine a large cohort (the INTERVAL study2; n = 50,000 participants) with extensive multi-omic data for plasma proteomics (SomaScan, n = 3,175; Olink, n = 4,822), plasma metabolomics (Metabolon HD4, n = 8,153), serum metabolomics (Nightingale, n = 37,359) and whole-blood Illumina RNA sequencing (n = 4,136), and use machine learning to train genetic scores for 17,227 molecular traits, including 10,521 that reach Bonferroni-adjusted significance. We evaluate the performance of genetic scores through external validation across cohorts of individuals of European, Asian and African American ancestries. In addition, we show the utility of these multi-omic genetic scores by quantifying the genetic control of biological pathways and by generating a synthetic multi-omic dataset of the UK Biobank3 to identify disease associations using a phenome-wide scan. We highlight a series of biological insights with regard to genetic mechanisms in metabolism and canonical pathway associations with disease; for example, JAK-STAT signalling and coronary atherosclerosis. Finally, we develop a portal ( https://www.omicspred.org/ ) to facilitate public access to all genetic scores and validation results, as well as to serve as a platform for future extensions and enhancements of multi-omic genetic scores.
U2 - 10.1038/s41586-023-05844-9
DO - 10.1038/s41586-023-05844-9
M3 - Article
C2 - 36991119
SN - 0028-0836
VL - 616
SP - 123
EP - 131
JO - Nature
JF - Nature
IS - 7955
ER -