In BMC infectious diseases ; h5-index 58.0
BACKGROUND : Mexico ranks fifth worldwide in the number of deaths due to COVID-19. Identifying risk markers through easily accessible clinical data could help in the initial triage of COVID-19 patients and anticipate a fatal outcome, especially in the most socioeconomically disadvantaged regions. This study aims to identify markers that increase lethality risk in patients diagnosed with COVID-19, based on machine learning (ML) methods. Markers were differentiated by sex and age-group.
METHODS : A total of 11,564 cases of COVID-19 in Mexico were extracted from the Epidemiological Surveillance System for Viral Respiratory Disease. Four ML classification methods were trained to predict lethality, and an interpretability approach was used to identify those markers.
RESULTS : Models based on Extreme Gradient Boosting (XGBoost) yielded the best performance in a test set. This model achieved a sensitivity of 0.91, a specificity of 0.69, a positive predictive value of 0.344, and a negative predictive value of 0.965. For female patients, the leading markers are diabetes and arthralgia. For males, the main markers are chronic kidney disease (CKD) and chest pain. Dyspnea, hypertension, and polypnea increased the risk of death in both sexes.
CONCLUSIONS : ML-based models using an interpretability approach successfully identified risk markers for lethality by sex and age. Our results indicate that age is the strongest demographic factor for a fatal outcome, while all other markers were consistent with previous clinical trials conducted in a Mexican population. The markers identified here could be used as an initial triage, especially in geographic areas with limited resources.
Rojas-García Mariano, Vázquez Blanca, Torres-Poveda Kirvis, Madrid-Marina Vicente
2023-Jan-11
COVID-19, Lethality risk markers, Machine learning, Mexico