In Zhonghua wai ke za zhi [Chinese journal of surgery]
Objective: To compare the performance of multiple machine learning algorithms in predicting recurrence after resection of early-stage hepatocellular carcinoma(HCC). Methods: Clinical data of 882 early-stage HCC patients who were admitted to the First Affiliated Hospital of Nanjing Medical University from May 2009 to December 2019 and treated with curative surgical resection were retrospectively collected. There were 701 males and 181 females,with an age of (57.3±10.5)years(range:21 to 86 years). All patients were randomly assigned in a 2∶1 ratio, the training dataset consisted of 588 patients and the test dataset consisted of 294 patients. The construction of machine learning-based prediction models included random survival forest(RSF),gradient boosting machine,elastic net regression and Cox regression model. The prediction accuracy of the model was measured by the concordance index(C-index). The prediction error of the model was measured by the integrated Brier score. Model fit was assessed by the calibration plot. The performance of machine learning models with that of rival model and HCC staging systems was compared. All models were validated in the independent test dataset. Results: Median recurrence-free survival was 61.7 months in the training dataset while median recurrence-free survival was 61.9 months in the validation dataset, there was no significant difference between two datasets in terms of recurrence-free survival(χ²=0.029,P=0.865). The RSF model consisted of 5 commonly used clinicopathological characteristics, including albumin-bilirubin grade,serum alpha fetoprotein,tumor number,type of hepatectomy and microvascular invasion. In both training and test datasets,the RSF model provided the best prediction accuracy,with respective C-index of 0.758(95%CI:0.725 to 0.791) and 0.749(95%CI:0.700 to 0.797),and the lowest prediction error,with respective integrated Brier score of 0.171 and 0.151. The prediction accuracy of RSF model for recurrence after resection of early-stage HCC was superior to that of other machine learning models,rival model(ERASL model) as well as HCC staging systems(BCLC,CNLC and TNM staging),with statistically significant difference(P<0.01). Calibration curves demonstrated good agreement between RSF model-predicted probabilities and observed outcomes.All patients could be stratified into low-risk,intermediate-risk or high-risk group based on RSF model;statistically significant differences among three risk groups were observed in both training and test datasets(P<0.01). The risk stratification of RSF model was superior to that of TNM staging. Conclusion: The proposed RSF model assembled with 5 commonly used clinicopathological characteristics in this study can predict the recurrence risk with favorable accuracy that may facilitate clinical decision-support for patients with early-stage HCC.
Ji G W, Wang K, Xia Y X, Li X C, Wang X H