TY - JOUR
T1 - Global Biobank Meta-analysis Initiative: powering genetic discovery across human disease
AU - Hayward, Caroline
AU - Campbell, Archie
AU - Porteous, David John
AU - Richmond , Anne
AU - Marioni, Riccardo E
N1 - Funding Information:
The work of the contributing biobanks was supported by numerous grants from governmental and charitable bodies. Biobank-specific acknowledgments and more detailed acknowledgments are included in Data S2. Initiative management, S.B.C. J.C. N.J.C. M.J.D. E.E.K. A.R.M. B.M.N. Y.O. A.V.P. D.A.v.H. R.G.W. C.J.W. W.Z. and S.Z.; individual biobank analysis, A.B. Y.B. B.M.B. C.D.B. S.C. T.-T.C. K.C. S.M.D. M.D. G.H.d.B. Y.D. N.J.D. M.-J.F. Y.-C.A.F. S.F. V.L.F. L.G.F. E.R.G. T.R.G. D.H.G. C.R.G. G.G.-A. S.E.G. L.A.G. C.H. J.B.H. W.E.H. H.H. K.H. N.I. A.I. R.J. M. Kurki, J.K. N.K. E.E.K. J.T.K. M. Kanai, T.L. K.L. M.H.L. S.L. K.L. Y.-F.L. V.L.F. R.J.F.L. E.A.L.-M. A.R.-M. S.M.-G. R.M. R.E.M. H.C.M. A.R.M. Y.M. H.M. S.E.M. I.Y.M. B.M. S.M. K.N. S.N. M.A.N.-A. K.N. Y.O. P.P. A.L.-P. A.P. B.P. S.P. M.H.P. D.J.R. N.R. M.D.R. A.R. C.S. S.S. S.S.S. J.A.S. P.S. I.S. T.T. R.T. K.T. J.U. D.A.v.H. B.V. M.V. Y.V. J.M.V. R.G.W. Y.W. S.J.W. B.N.W. K.-H.H.W. M.Z. X.Z. and S.Z.; individual biobank management, N.A. A.A.T. K.M.A.-D. P.A. K.C.B. M. Boehnke, M. Boezen, C.D.B. A.C. Z.C. C.-Y.C. J.C. N.J.C. S.M.D. S.F. Y.-C.A.F. S.F. E.F. T.G. C.R.G. C.J.G. Y.G. H.H. K.A.H. K.H. S.I.I. N.M.J. N.K. E.E.K. J.T.K. C.L. M.H.L. M.T.M.L. L.L. K.L. Y.-F.L. R.J.F.L. J.L. S.M. Y.M. K.M. I.Y.M. Y.O. C.M.O. A.V.P. B.P. D.J.P. D.J.R. M.D.R. S.S. J.W.S. H.S. K.S. T.T. U.T. R.C.T. D.A.v.H. M.V. R.G.W. D.C.W. C.W. J.W. M.Z. X.Z. and S.Z.; study design and interpretation of results, A.B. M. Boehnke, M. Boezen, B.M.B. T.-T.C. C.-Y.C. M.J.D. G.D.S. N.J.D. S.F. M.-J.F. H.K.F. E.R.G. A.G. T.G. J.B.H. J.H. K.H. R.J. M.K. E.E.K. T.K. C.M.L. V.L.F. E.A.L.-M. A.R.M. S.N. B.M.N. C.M.O. J.J.P. B.P. N.R. H.R. J.A.S. I.S. K.T. D.A.v.H. R.G.W. Y.W. D.C.W. S.J.W. C.J.W. B.N.W. J.W. K.-H.H.W. M.Z. H.Z. J.Z. W.Z. X.Z. and S.Z.; drafted and edited the paper, A.B. M. Boehnke, M. Boezen, M.J.D. G.H.d.B. N.J.D. T.R.G. J.B.H. N.I. N.M.J. M.K. V.L.F. S.M. A.R.M. H.M. S.N. B.M.N. C.M.O. B.P. H.R. C.S. J.A.S. J.W.S. K.T. Y.W. D.C.W. C.J.W. K.-H.H.W. H.Z. J.Z. W.Z. and S.Z.; primary meta-analysis and quality control, M.J.D. H.K.F. M. Kanai, J.K. J.T.K. M. Kurki, M.M. B.M.N. C.J.W. K.-H.H.W. and W.Z.; drug discovery: S.N. T.K. K.-H.H.W. W.Z. and Y.O.; fine mapping, M. Kanai, W.Z. M.J.D. and H.K.F.; polygenic risk score, Y.W. S.N. E.A.L.-M. S.K. K.T. K.L. M. Kanai, W.Z. K.W. M.-J.F. L.B. P.A. P.D. V.L.F. R.M. Y.M. B.B. S.S. J.U. E.R.G. N.J.C. I.S. Y.O. A.R.M. and J.B.H.; proteome-wide Mendelian randomization, H.Z. H.R. A.B. G.H. G.D.S. B.M.B. W.Z. B.M.N. T.R.G. and J.Z.; transcriptome-wide association study, A.B. J.B.H. W.Z. J.Z. M. Kanai, B.P. E.R.G. and N.J.C.; asthma, K.T. W.Z. Y.W. M. Kanai, S.N. Y.O. B.M.N. M.J.D. and A.R.M.; heart failure, K.-H.H.W. N.J.D. B.N.W. I.S. S.E.G. J.B.H. N.J.C. M.P. R.J.F.L. M.J.D. B.M.N. W.Z. W.E.H. and C.J.W.; idiopathic pulmonary fibrosis, J.J.P. W.Z. M.J.D. J.T.K. N.J.C. and J.B.H.; primary open-angle glaucoma, V.L.F. A.B. W.Z. Y.W. K.L. M. Kanai, E.A.L.-M. P.S. R.T. X.Z. S.N. S.S. Y.O. N.I. S.M. H.S. I.S. C.W. A.R.M. E.R.G. N.M.J. N.J.C. and J.B.H.; stroke, I.S. K.-H.H.W. W.H. B.N.W. W.Z. J.E.H. A.P. B.B. A.H.S. M.E.G. R.G.W. K.H. C.K. S.Z. M.J.D. B.M.N. and C.J.W.; venous thromboembolism, B.N.W. I.S. K.-H.H.W. B.B. V.L.F. K.T. M.D. B.N. W.Z. J.A.S. and C.J.W. All authors reviewed the manuscript. M.J.D. is a founder of Maze Therapeutics. B.M.N. is a member of the scientific advisory board at Deep Genomics and a consultant for Camp4 Therapeutics, Takeda Pharmaceutical, and Biogen. The spouse of C.J.W. works at Regeneron Pharmaceuticals. C.-Y.C. is employed by Biogen. C.R.G. owns stock in 23andMe, Inc. T.R.G. has received research funding from various pharmaceutical companies to support the application of Mendelian randomization to drug target prioritization. E.E.K. has received speaker fees from Regeneron, Illumina, and 23andMe and is a member of the advisory board for Galateo Bio. R.E.M. has received speaker fees from Illumina and is a scientific advisor to the Epigenetic Clock Development Foundation. G.D.S. has received research funding from various pharmaceutical companies to support the application of Mendelian randomization to drug target prioritization. K.S. and U.T. are employed by deCODE Genetics/Amgen, Inc. J.Z. has received research funding from various pharmaceutical companies to support the application of Mendelian randomization to drug target prioritization. S.M. is a co-founder of and holds stock in Seonix Bio.
Publisher Copyright:
© 2022
PY - 2022/10/12
Y1 - 2022/10/12
N2 - Biobanks are being established across the world to understand the genetic and environmental basis of human diseases with the goal of better prevention and treatments. Genome-wide association studies (GWAS) have been very successful at mapping genomic loci for a wide range of human diseases and traits, but in general lack appropriate representation of diverse ancestries - with most biobanks and preceding GWAS studies composed of individuals of European ancestry. Here, we introduce the Global Biobank Meta-analysis Initiative (GBMI) -- a collaborative network of 23 biobanks from 4 continents representing more than 2.2 million consented individuals with genetic data linked to electronic health records. GBMI meta-analyzes summary statistics from GWAS generated using harmonized genotypes and phenotypes from member biobanks. GBMI brings together results from GWAS analyses across 6 main ancestry groups: approximately 42,000 of African ancestries from admixed-ancestry diaspora (AFR), 18,000 admixed American (AMR), 31,000 Central and South Asian (CSA), 415,000 East Asian (EAS), 1.4 million European (EUR), and 12,000 Middle Eastern (MID) individuals. In this flagship project, we generated GWASs from across 14 exemplar diseases and endpoints, including both common and less prevalent diseases that were previously understudied. Using the genetic association results, we validate that GWASs conducted in biobanks worldwide can be successfully integrated despite heterogeneity in case definitions, recruitment strategies, and baseline characteristics between biobanks. We demonstrate the value of this collaborative effort to improve GWAS power for diseases, increase representation, benefit understudied diseases, and improve risk prediction while also enabling the nomination of disease genes and drug candidates by incorporating gene and protein expression data and providing insight into the underlying biology of human diseases and traits.
AB - Biobanks are being established across the world to understand the genetic and environmental basis of human diseases with the goal of better prevention and treatments. Genome-wide association studies (GWAS) have been very successful at mapping genomic loci for a wide range of human diseases and traits, but in general lack appropriate representation of diverse ancestries - with most biobanks and preceding GWAS studies composed of individuals of European ancestry. Here, we introduce the Global Biobank Meta-analysis Initiative (GBMI) -- a collaborative network of 23 biobanks from 4 continents representing more than 2.2 million consented individuals with genetic data linked to electronic health records. GBMI meta-analyzes summary statistics from GWAS generated using harmonized genotypes and phenotypes from member biobanks. GBMI brings together results from GWAS analyses across 6 main ancestry groups: approximately 42,000 of African ancestries from admixed-ancestry diaspora (AFR), 18,000 admixed American (AMR), 31,000 Central and South Asian (CSA), 415,000 East Asian (EAS), 1.4 million European (EUR), and 12,000 Middle Eastern (MID) individuals. In this flagship project, we generated GWASs from across 14 exemplar diseases and endpoints, including both common and less prevalent diseases that were previously understudied. Using the genetic association results, we validate that GWASs conducted in biobanks worldwide can be successfully integrated despite heterogeneity in case definitions, recruitment strategies, and baseline characteristics between biobanks. We demonstrate the value of this collaborative effort to improve GWAS power for diseases, increase representation, benefit understudied diseases, and improve risk prediction while also enabling the nomination of disease genes and drug candidates by incorporating gene and protein expression data and providing insight into the underlying biology of human diseases and traits.
KW - biobank meta-analysis
KW - genetic association studies
KW - phenotype harmonization
KW - ancestry diversity
U2 - https://doi.org/10.1016/j.xgen.2022.100192
DO - https://doi.org/10.1016/j.xgen.2022.100192
M3 - Article
SN - 2666-979X
JO - Cell Genomics
JF - Cell Genomics
ER -