TY - JOUR
T1 - Development and validation of DNA Methylation scores in two European cohorts augment 10-year risk prediction of type 2 diabetes
AU - Cheng, Yipeng
AU - Gadd, Danni
AU - Gieger, Christian
AU - Monterrubio-Gomez, Karla
AU - Zhang, Yufei
AU - Berta, Imrich
AU - Stam, Michael
AU - Szlachetka, Natalia
AU - Lobzaev, Evgenii
AU - Wrobel, Nicola
AU - Murphy, Lee
AU - Campbell, Archie
AU - Nangle, Cliff
AU - Walker, Rosie
AU - Fawns-Ritchie, Chloe
AU - Peters, Annette
AU - Rathmann, Wolfgang
AU - Porteous, David John
AU - Evans, Kathryn Louise
AU - McIntosh, Andrew M
AU - Cannings, Timothy I
AU - Waldenberger, Melanie
AU - Ganna, Andrea
AU - McCartney, Daniel L
AU - Vallejos, Catalina A
AU - Marioni, Riccardo E
PY - 2023/4/6
Y1 - 2023/4/6
N2 - Type 2 diabetes mellitus (T2D) presents a major health and economic burden that could be alleviated with improved early prediction and intervention. While standard risk factors have shown good predictive performance, we show that the use of blood-based DNA methylation information leads to a significant improvement in the prediction of 10-year T2D incidence risk. Previous studies have been largely constrained by linear assumptions, the use of cytosine–guanine pairs one-at-a-time and binary outcomes. We present a flexible approach (via an R package, MethylPipeR) based on a range of linear and tree-ensemble models that incorporate time-to-event data for prediction. Using the Generation Scotland cohort (training set ncases = 374, ncontrols = 9,461; test set ncases = 252, ncontrols = 4,526) our best-performing model (area under the receiver operating characteristic curve (AUC) = 0.872, area under the precision-recall curve (PRAUC) = 0.302) showed notable improvement in 10-year onset prediction beyond standard risk factors (AUC = 0.839, precision–recall AUC = 0.227). Replication was observed in the German-based KORA study (n = 1,451, ncases = 142, P = 1.6 × 10−5).
AB - Type 2 diabetes mellitus (T2D) presents a major health and economic burden that could be alleviated with improved early prediction and intervention. While standard risk factors have shown good predictive performance, we show that the use of blood-based DNA methylation information leads to a significant improvement in the prediction of 10-year T2D incidence risk. Previous studies have been largely constrained by linear assumptions, the use of cytosine–guanine pairs one-at-a-time and binary outcomes. We present a flexible approach (via an R package, MethylPipeR) based on a range of linear and tree-ensemble models that incorporate time-to-event data for prediction. Using the Generation Scotland cohort (training set ncases = 374, ncontrols = 9,461; test set ncases = 252, ncontrols = 4,526) our best-performing model (area under the receiver operating characteristic curve (AUC) = 0.872, area under the precision-recall curve (PRAUC) = 0.302) showed notable improvement in 10-year onset prediction beyond standard risk factors (AUC = 0.839, precision–recall AUC = 0.227). Replication was observed in the German-based KORA study (n = 1,451, ncases = 142, P = 1.6 × 10−5).
U2 - 10.1038/s43587-023-00391-4
DO - 10.1038/s43587-023-00391-4
M3 - Article
SN - 2662-8465
VL - 3
SP - 450
EP - 458
JO - Nature Aging
JF - Nature Aging
ER -