In Pathology, research and practice
SARS-CoV-2 pandemic is the current threat of the world with enormous number of deceases. As most of the countries have constraints on resources, particularly for intensive care and oxygen, severity prediction with high accuracy is crucial. This prediction will help the medical society in the selection of patients with the need for these constrained resources. Literature shows that using clinical data in this study is the common trend and molecular data is rarely utilized in this prediction. As molecular data carry more disease related information, in this study, three different types of RNA molecules ( lncRNA, miRNA and mRNA) of SARS-COV-2 patients are used to predict the severity stage and treatment stage of those patients. Using seven different machine learning algorithms along with several feature selection techniques shows that in both phenotypes, feature importance selected features provides the best accuracy along with random forest classifier. Further to this, it shows that in the severity stage prediction miRNA and lncRNA give the best performance, and lncRNA data gives the best in treatment stage prediction. As most of the studies related to molecular data uses mRNA data, this is an interesting finding.
COVID-19 molecular data, Classification algorithm, Feature selection, Severity prediction, Treatment stage, lncRNA, miRNA and mRNA