In Neural processing letters
Training a machine learning model on the data sets with missing labels is a challenging task. Not all models can handle the problem of missing labels. However, if these data sets are further corrupted with label noise, it becomes even more challenging to train a machine learning model on such data sets. We propose to use a transductive support vector machine (TSVM) for semi-supervised learning in this situation. We make this model robust to label noise by using a truncated pinball loss function with it. We name our approach, -TSVM. We provide both the primal and the dual formulations of the obtained robust TSVM for linear and non-linear kernels. We also perform experiments on synthetic and real-world data sets to prove the superior robustness of our model as compared to the existing approaches. To this end, we use small as well as large-scale data sets to perform the experiments. We show that the model is capable of training in the presence of label noise and finding the missing labels of the data samples. We use this property of -TSVM to detect the coronavirus patients based on their chest X-ray images.
Singla Manisha, Ghosh Debdas, Shukla K K
COVID-19, Robust statistics, Semi-supervised learning, Transductive support vector machine, Truncated pinball loss function, VGG-19