In Journal of medical Internet research ; h5-index 88.0
BACKGROUND : In December 2019, Coronavirus disease 2019 (COVID-19) outbreak started in China and rapidly spread around the world. Lack of any vaccine or optimized intervention raised the importance of characterizing risk factors and symptoms for the early identification and successful treatment of COVID-19 patients.
OBJECTIVE : This study aims to investigate and analyze biomedical literature and public social media data to understand the association of risk factors and symptoms with various outcomes of COVID-19 patients.
METHODS : Through semantic analysis, we collected 45 retrospective cohort studies, which evaluated 303 clinical and demographic variables across 13 different outcomes of COVID-19 patients, and 84,140 Twitter posts from 1,036 COVID-19 positive users. Machine-learning tools to extract biomedical information were introduced to identify uncommon or novel symptoms mentioning in social media. We then examined and compared two datasets to expand our landscape of risk factors and symptoms related to COVID-19.
RESULTS : From the biomedical literature, approximately 90% of clinical and demographic variables showed inconsistent associations with COVID-19 outcomes. Consensus analysis identified 72 risk factors that were specifically associated with individual outcomes. From the social media data, 51 symptoms were characterized and analyzed. By comparing social media data with biomedical literature, we identified 25 novel symptoms that were specifically mentioned in social media but have been not previously well characterized. Furthermore, there were certain combinations of symptoms that were frequently mentioned together in social media.
CONCLUSIONS : Identified outcome-specific risk factors, symptoms, and combinations of symptoms may serve as surrogate indicators to identify COVID-19 patients and predict their clinical outcomes providing appropriate treatments.
Jeon Jouhyun, Baruah Gaurav, Sarabadani Sarah, Palanica Adam