OBJECTIVE: Risk prediction models are widely used to inform evidence-based clinical decision making. However, few models developed from single cohorts can perform consistently well at population level where diverse prognoses exist (such as the SARS-CoV2 pandemic). This study aims at tackling this challenge by synergising prediction models from the literature using ensemble learning.
MATERIALS AND METHODS: In this study we selected and reimplemented seven prediction models for COVID-19, which were derived from diverse cohorts and used different implementation techniques. A novel ensemble learning framework was proposed to synergise them for realising personalised predictions for individual patients. Four diverse international cohorts (2 from the UK and 2 from China; total N=5,394) were used to validate all eight models on discrimination, calibration and clinical usefulness.
RESULTS: Results showed that individual prediction models could perform well on some cohorts while poorly on others. Conversely, the ensemble model achieved the best performances consistently on all metrics quantifying discrimination, calibration and clinical usefulness. Performance disparities were observed in cohorts from the two countries: all models achieved better performances on the China cohorts.
DISCUSSION: When individual models were learned from complementary cohorts, the synergised model will have the potential to achieve synergised performances. Results indicate that blood parameters and physiological measurements might have better predictive powers when collected early, which remains to be confirmed by further studies.
CONCLUSIONS: Combining a diverse set of individual prediction models, ensemble method can synergise a robust and well-performing model by choosing the most competent ones for individual patients.
|Journal||Journal of the American Medical Informatics Association|
|Early online date||13 Nov 2020|
|Publication status||E-pub ahead of print - 13 Nov 2020|