On Classifying Sepsis Heterogeneity in the ICU: Insight Using Machine Learning

Zina M. Ibrahim, Honghan Wu, Ahmed Hamoud, Lukas Stappen, Richard J.B. Dobson, Andrea Agarossi

Research output: Contribution to journalArticlepeer-review

Abstract / Description of output

Current machine learning models aiming to predict sepsis from Electronic Health Records (EHR) do not account for the heterogeneity of the condition, despite its emerging importance in prognosis and treatment. This work demonstrates the added value of stratifying the types of organ dysfunction observed in patients who develop sepsis in the ICU in improving the ability to recognise patients at risk of sepsis from their EHR data. Using an ICU dataset of 13,728 records, we identify clinically significant sepsis subpopulations with distinct organ dysfunction patterns. Classification experiments using Random Forest, Gradient Boost Trees and Support Vector Machines, aiming to distinguish patients who develop sepsis in the ICU from those who do not, show that features selected using sepsis subpopulations as background knowledge yield a superior performance regardless of the classification model used. Our findings can steer machine learning efforts towards more personalised models for complex conditions including sepsis.
Original languageEnglish
JournalJAMIA: A Scholarly Journal of Informatics in Health and Biomedicine (JAMIA)
Early online date17 Jan 2020
Publication statusE-pub ahead of print - 17 Jan 2020


Dive into the research topics of 'On Classifying Sepsis Heterogeneity in the ICU: Insight Using Machine Learning'. Together they form a unique fingerprint.

Cite this