In Diabetology & metabolic syndrome
BACKGROUND : Experiencing a hyperglycaemic crisis is associated with a short- and long-term increased risk of mortality. We aimed to develop an explainable machine learning model for predicting 3-year mortality and providing individualized risk factor assessment of patients with hyperglycaemic crisis after admission.
METHODS : Based on five representative machine learning algorithms, we trained prediction models on data from patients with hyperglycaemic crisis admitted to two tertiary hospitals between 2016 and 2020. The models were internally validated by tenfold cross-validation and externally validated using previously unseen data from two other tertiary hospitals. A SHapley Additive exPlanations algorithm was used to interpret the predictions of the best performing model, and the relative importance of the features in the model was compared with the traditional statistical test results.
RESULTS : A total of 337 patients with hyperglycaemic crisis were enrolled in the study, 3-year mortality was 13.6% (46 patients). 257 patients were used to train the models, and 80 patients were used for model validation. The Light Gradient Boosting Machine model performed best across testing cohorts (area under the ROC curve 0.89 [95% CI 0.77-0.97]). Advanced age, higher blood glucose and blood urea nitrogen were the three most important predictors for increased mortality.
CONCLUSION : The developed explainable model can provide estimates of the mortality and visual contribution of the features to the prediction for an individual patient with hyperglycaemic crisis. Advanced age, metabolic disorders, and impaired renal and cardiac function were important factors that predicted non-survival.
TRIAL REGISTRATION NUMBER : ChiCTR1800015981, 2018/05/04.
Xie Puguang, Yang Cheng, Yang Gangyi, Jiang Youzhao, He Min, Jiang Xiaoyan, Chen Yan, Deng Liling, Wang Min, Armstrong David G, Ma Yu, Deng Wuquan
2023-Mar-11
Explainable model, Hyperglycaemic crisis, Machine learning, Mortality