TY - JOUR
T1 - Physician-Confirmed and Administrative Definitions of Stroke in UK Biobank Reflect the Same Underlying Genetic Trait
AU - Rannikmäe, Kristiina
AU - Rawlik, Konrad
AU - Ferguson, Amy C
AU - Avramidis, Nikos
AU - Jiang, Muchen
AU - Pirastu, Nicola
AU - Shen, Xia
AU - Davidson, Emma
AU - Woodfield, Rebecca
AU - Malik, Rainer
AU - Dichgans, Martin
AU - Tenesa, Albert
AU - Sudlow, Cathie
N1 - Copyright © 2022 Rannikmäe, Rawlik, Ferguson, Avramidis, Jiang, Pirastu, Shen, Davidson, Woodfield, Malik, Dichgans, Tenesa and Sudlow.
PY - 2022/2/2
Y1 - 2022/2/2
N2 - Background: Stroke in UK Biobank (UKB) is ascertained via linkages to coded administrative datasets and self-report. We studied the accuracy of these codes using genetic validation.Methods: We compiled stroke-specific and broad cerebrovascular disease (CVD) code lists (Read V2/V3, ICD-9/-10) for medical settings (hospital, death record, primary care) and self-report. Among 408,210 UKB participants, we identified all with a relevant code, creating 12 stroke definitions based on the code type and source. We performed genome-wide association studies (GWASs) for each definition, comparing summary results against the largest published stroke GWAS (MEGASTROKE), assessing genetic correlations, and replicating 32 stroke-associated loci.Results: The stroke case numbers identified varied widely from 3,976 (primary care stroke-specific codes) to 19,449 (all codes, all sources). All 12 UKB stroke definitions were significantly correlated with the MEGASTROKE summary GWAS results (rg.81-1) and each other (rg.4-1). However, Bonferroni-corrected confidence intervals were wide, suggesting limited precision of some results. Six previously reported stroke-associated loci were replicated using ≥1 UKB stroke definition.Conclusions: Stroke case numbers in UKB depend on the code source and type used, with a 5-fold difference in the maximum case-sample size. All stroke definitions are significantly genetically correlated with the largest stroke GWAS to date.
AB - Background: Stroke in UK Biobank (UKB) is ascertained via linkages to coded administrative datasets and self-report. We studied the accuracy of these codes using genetic validation.Methods: We compiled stroke-specific and broad cerebrovascular disease (CVD) code lists (Read V2/V3, ICD-9/-10) for medical settings (hospital, death record, primary care) and self-report. Among 408,210 UKB participants, we identified all with a relevant code, creating 12 stroke definitions based on the code type and source. We performed genome-wide association studies (GWASs) for each definition, comparing summary results against the largest published stroke GWAS (MEGASTROKE), assessing genetic correlations, and replicating 32 stroke-associated loci.Results: The stroke case numbers identified varied widely from 3,976 (primary care stroke-specific codes) to 19,449 (all codes, all sources). All 12 UKB stroke definitions were significantly correlated with the MEGASTROKE summary GWAS results (rg.81-1) and each other (rg.4-1). However, Bonferroni-corrected confidence intervals were wide, suggesting limited precision of some results. Six previously reported stroke-associated loci were replicated using ≥1 UKB stroke definition.Conclusions: Stroke case numbers in UKB depend on the code source and type used, with a 5-fold difference in the maximum case-sample size. All stroke definitions are significantly genetically correlated with the largest stroke GWAS to date.
KW - accuracy
KW - genetic correlation
KW - routinely collected health data
KW - stroke
KW - validation
U2 - 10.3389/fneur.2021.787107
DO - 10.3389/fneur.2021.787107
M3 - Article
C2 - 35185750
VL - 12
JO - Frontiers in Neurology
JF - Frontiers in Neurology
SN - 1664-2295
M1 - 787107
ER -