In Journal of clinical ultrasound : JCU
BACKGROUND : In the recent years, artificial intelligence (AI) algorithms have been used to accurately diagnose musculoskeletal diseases. However, it is not known whether the particular regions of interest (ROI) delineation method would affect the performance of the AI algorithm.
PURPOSE : The purpose of this study was to investigate the influence of ROI delineation methods on model performance and observer consistency.
METHODS : In this retrospective analysis, ultrasound (US) measures of median nerves affected with carpal tunnel syndrome (CTS) were compared to median nerves in a control group without CTS. Two methods were used for delineation of the ROI: (1) the ROI along the hyperechoic medial edge of the median nerve but not including the epineurium (MN) (ROI1); and (2) the ROI including the hyperechoic epineurium (ROI2), respectively. The intra group correlation coefficient (ICC) was used to compare the observer consistency of ROI features (i.e. the corresponding radiomics parameters). Parameters α1 and α2 were obtained based on the ICC of ROI1 features and ROI2 features. The ROC analysis was used to determine the area under the curve (AUC) and evaluate the performance of the radiologists and network. In addition, four indices, namely sensitivity, specificity, positive prediction and negative prediction were analyzed too.
RESULTS : A total of 136 wrists of 77 CTS group and 136 wrists of 74 control group were included in the study. Control group was matched to CTS group according to the age and sex. The observer consistency of ROI features delineated by the two schemes was different, and the consistency of ROI1 features was higher (α1 ˃ α2). The intra-observer consistency was higher than the inter-observer consistency regardless of the scheme, and the intra-observer consistency was higher when chose scheme one. The performances of models based on the two ROI features were different, although the AUC of each model was greater than 0.8.The model performed better when the MN epineurium was included in the ROI. Among five artificial intelligence algorithms, the Forest models (model1 achieved an AUC of 0.921 in training datasets and 0.830 in testing datasets; model2 achieved an AUC of 0.967 in training datasets and 0.872 in testing datasets.) obtained the highest performance, followed by the support vector machine (SVM) models and the Logistic models. The performances of the models were significantly better than the inexperienced radiologist (Dr. B. Z. achieved an AUC of 0.702).
CONCLUSION : Different ROI delineation methods may affect the performance of the model and the consistency of observers. Model performance was better when the ROI contained the MN epineurium, and observer consistency was higher when the ROI was delineated along the hyperechoic medial border of the MN.
Lyu Shuyi, Zhang Yan, Zhang Meiwu, Jiang Maoqing, Yu Jianjun, Zhu Jiazhen, Zhang Baisong
2022-Nov-07
carpal tunnel syndrome, radiomics, regions of interest, ultrasound