TY - JOUR
T1 - Assessment of genotyping array performance for genome-wide association studies and imputation in African cattle
AU - Riggio, Valentina
AU - Tijjani, Abdulfatai
AU - Callaby, Rebecca
AU - Talenti, Andrea
AU - Wragg, David
AU - Obishakin, Emmanuel T.
AU - Ezeasor, Chukwunonso
AU - Jongejan, Frans
AU - Ogo, Ndudim I.
AU - Aboagye-Antwi, Fred
AU - Toure, Alassane
AU - Nzalawahe, Jahashi
AU - Diallo, Boubacar
AU - Missohou, Ayao
AU - Belem, Adrien M.G.
AU - Djikeng, Appolinaire
AU - Juleff, Nick
AU - Fourie, Josephus
AU - Labuschagne, Michel
AU - Madder, Maxime
AU - Marshall, Karen
AU - Prendergast, James
AU - Morrison, Liam
N1 - Funding Information:
This research was funded in part by the Bill & Melinda Gates Foundation (BMGF) and with UK aid from the UK Foreign, Commonwealth and Development Office (Grant Agreement OPP1127286) under the auspices of the Centre for Tropical Livestock Genetics and Health (CTLGH), established jointly by the University of Edinburgh, SRUC (Scotland’s Rural College), and the International Livestock Research Institute. The survey was partially funded by the Bill & Melinda Gates Foundation (BMGF) (Grant OPP1125367). This research was also in part funded by the United Kingdom Biotechnology and Biological Sciences Research Council through Roslin Institute Strategic Programme Grant funding (BBS/E/D/30002275, BBS/E/D/10002070, BBS/E/D/20002172). For the purpose of open access, the author has applied a CC BY public copyright licence to any Author Accepted Manuscript version arising from this submission.
Funding Information:
This research was funded in part by the Bill & Melinda Gates Foundation and with UK aid from the UK Foreign, Commonwealth and Development Office (Grant Agreement OPP1127286) under the auspices of the Centre for Tropical Livestock Genetics and Health (CTLGH), established jointly by the University of Edinburgh, SRUC (Scotland’s Rural College), and the International Livestock Research Institute. The findings and conclusions contained within are those of the authors and do not necessarily reflect positions or policies of the Bill & Melinda Gates Foundation nor the UK Government. Under the grant conditions of the Foundation, a Creative Commons Attribution 4.0 Generic License has already been assigned to the Author Accepted Manuscript version that might arise from this submission. This work was also conducted as part of the ILRI-led CGIAR Research Program on Livestock, which is supported by contributors to the CGIAR Trust Fund: https://www.cgiar.org/funders/ . The authors would also wish to acknowledge Dr Tadelle Dessie of ILRI Ethiopian, for his support in obtaining an ABS permit for the use of Ethiopian Boran cattle sequences.
Publisher Copyright:
© 2022, The Author(s).
PY - 2022/9/4
Y1 - 2022/9/4
N2 - BACKGROUND: In cattle, genome-wide association studies (GWAS) have largely focused on European or Asian breeds, using genotyping arrays that were primarily designed for European cattle. Because there is growing interest in performing GWAS in African breeds, we have assessed the performance of 23 commercial bovine genotyping arrays for capturing the diversity across African breeds and performing imputation. We used 409 whole-genome sequences (WGS) spanning global cattle breeds, and a real cohort of 2481 individuals (including African breeds) that were genotyped with the Illumina high-density (HD) array and the GeneSeek bovine 50 k array.RESULTS: We found that commercially available arrays were not effective in capturing variants that segregate among African indicine animals. Only 6% of these variants in high linkage disequilibrium (LD) (r
2 > 0.8) were on the best performing arrays, which contrasts with the 17% and 25% in African and European taurine cattle, respectively. However, imputation from available HD arrays can successfully capture most variants (accuracies up to 0.93), mainly when using a global, not continent-specific, reference panel, which partially reflects the unusually high levels of admixture on the continent. When considering functional variants, the GGPF250 array performed best for tagging WGS variants and imputation. Finally, we show that imputation from low-density arrays can perform almost as well as HD arrays, if a two-stage imputation approach is adopted, i.e. first imputing to HD and then to WGS, which can potentially reduce the costs of GWAS.
CONCLUSIONS: Our results show that the choice of an array should be based on a balance between the objective of the study and the breed/population considered, with the HD and BOS1 arrays being the best choice for both taurine and indicine breeds when performing GWAS, and the GGPF250 being preferable for fine-mapping studies. Moreover, our results suggest that there is no advantage to using the indicus-specific arrays for indicus breeds, regardless of the objective. Finally, we show that using a reference panel that better represents global bovine diversity improves imputation accuracy, particularly for non-European taurine populations.
AB - BACKGROUND: In cattle, genome-wide association studies (GWAS) have largely focused on European or Asian breeds, using genotyping arrays that were primarily designed for European cattle. Because there is growing interest in performing GWAS in African breeds, we have assessed the performance of 23 commercial bovine genotyping arrays for capturing the diversity across African breeds and performing imputation. We used 409 whole-genome sequences (WGS) spanning global cattle breeds, and a real cohort of 2481 individuals (including African breeds) that were genotyped with the Illumina high-density (HD) array and the GeneSeek bovine 50 k array.RESULTS: We found that commercially available arrays were not effective in capturing variants that segregate among African indicine animals. Only 6% of these variants in high linkage disequilibrium (LD) (r
2 > 0.8) were on the best performing arrays, which contrasts with the 17% and 25% in African and European taurine cattle, respectively. However, imputation from available HD arrays can successfully capture most variants (accuracies up to 0.93), mainly when using a global, not continent-specific, reference panel, which partially reflects the unusually high levels of admixture on the continent. When considering functional variants, the GGPF250 array performed best for tagging WGS variants and imputation. Finally, we show that imputation from low-density arrays can perform almost as well as HD arrays, if a two-stage imputation approach is adopted, i.e. first imputing to HD and then to WGS, which can potentially reduce the costs of GWAS.
CONCLUSIONS: Our results show that the choice of an array should be based on a balance between the objective of the study and the breed/population considered, with the HD and BOS1 arrays being the best choice for both taurine and indicine breeds when performing GWAS, and the GGPF250 being preferable for fine-mapping studies. Moreover, our results suggest that there is no advantage to using the indicus-specific arrays for indicus breeds, regardless of the objective. Finally, we show that using a reference panel that better represents global bovine diversity improves imputation accuracy, particularly for non-European taurine populations.
KW - Animals
KW - Cattle/genetics
KW - Genome-Wide Association Study
KW - Genotype
KW - Humans
KW - Linkage Disequilibrium
KW - Polymorphism, Single Nucleotide
KW - Taurine
U2 - 10.1186/s12711-022-00751-5
DO - 10.1186/s12711-022-00751-5
M3 - Article
C2 - 36057548
SN - 0999-193X
VL - 54
SP - 1
EP - 23
JO - Genetics Selection Evolution
JF - Genetics Selection Evolution
IS - 1
M1 - 54:58
ER -