In Journal of occupational rehabilitation
PURPOSE : Machine learning (ML) methods showed a higher accuracy in identifying individuals without cancer who were unable to return to work (RTW) compared to the classical methods (e.g. logistic regression models). We therefore aim to discuss the value of these methods in relation to RTW for cancer survivors.
METHODS : Breast cancer (BC) survivors who were working at diagnosis within the CONSTANCES cohort were included in the study. RTW was assessed five years after the BC diagnosis (early retirement was considered as non-RTW). Age and occupation at diagnosis, and physical occupational job exposures assessed using the Job Exposure Matrix, JEM-CONSTANCES, were evaluated as predictors of RTW five years after BC diagnosis. The following four ML methods were used: (i) k-nearest neighbors; (ii) random forest; (iii) neural network; and (iv) elastic net.
RESULTS : The training sample included 683 BC survivors (RTW: 85.7%), and the test sample 171 (RTW: 85.4%). The elastic net method had the best results despite low sensitivity (accuracy = 76.6%; sensitivity = 31.7%; specificity = 90.8%), and the random forest model was the most accurate (= 79.5%) but also the least sensitive (= 14.3%).
CONCLUSION : This study takes a first step towards opening up new possibilities for identifying the occupational determinants of cancer survivors' RTW. Further work, including a larger sample size, and more predictor variables, is now needed.
Badreau Marie, Fadel Marc, Roquelaure Yves, Bertin Mélanie, Rapicault Clémence, Gilbert Fabien, Porro Bertrand, Descatha Alexis
2023-Mar-20
Breast cancer, Machine learning, Methods, Prediction, Return to work, Survivors