In Lasers in medical science
The aim of the study is to evaluate the efficacy of the combination of Raman spectroscopy with feature engineering and machine learning algorithms for detecting glioma patients. In this study, we used Raman spectroscopy technology to collect serum spectra of glioma patients and healthy people and used feature engineering-based classification models for prediction. First, to reduce the dimensionality of the data, we used two feature extraction algorithms which are partial least squares (PLS) and principal component analysis (PCA). Then, the principal components were selected using the feature selection methods of four correlation indexes, namely, Relief-F (RF), the Pearson correlation coefficient (PCC), the F-score (FS) and term variance (TV). Finally, back-propagation neural network (BP), linear discriminant analysis (LDA) and support vector machine (SVM) classification models were established. To improve the reliability of the model, we used a fivefold cross validation to measure the prediction performance between different models. In this experiment, 33 classification models were established. Integrating 4 classification criteria, PLS-Relief-F-BP, PLS-F-Score-BP, PLS-LDA and PLS-Relief-F-SVM had better effects, and their accuracy rates reached 97.58%, 96.33%, 97.87% and 96.19%, respectively. The experimental results show that feature engineering can select more representative features, reduce computational time complexity and simplify the model. The classification model established in this experiment can not only increase the robustness of the model and shorten the discrimination time but also realize the rapid, stable and accurate diagnosis of glioma patients, which has high clinical application value.
Ma Mingrui, Tian Xuecong, Chen Fangfang, Ma Xiaojian, Guo Wenjia, Lv Xiaoyi
Classification, Feature engineering, Glioma, Raman spectroscopy