In Tomography (Ann Arbor, Mich.)
BACKGROUND AND PURPOSE : Fully automated methods for segmentation and volume quantification of intraparenchymal hemorrhage (ICH), intraventricular hemorrhage extension (IVH), and perihematomal edema (PHE) are gaining increasing interest. Yet, reliabilities demonstrate considerable variances amongst each other. Our aim was therefore to evaluate both the intra- and interrater reliability of ICH, IVH and PHE on ground-truth segmentation masks.
METHODS : Patients with primary spontaneous ICH were retrospectively included from a German tertiary stroke center (Charité Berlin; January 2016-June 2020). Baseline and follow-up non-contrast Computed Tomography (NCCT) scans were analyzed for ICH, IVH, and PHE volume quantification by two radiology residents. Raters were blinded to all demographic and outcome data. Inter- and intrarater agreements were determined by calculating the Intraclass Correlation Coefficient (ICC) for a randomly selected set of patients with ICH, IVH, and PHE.
RESULTS : 100 out of 670 patients were included in the analysis. Interrater agreements ranged from an ICC of 0.998 for ICH (95% CI [0.993; 0.997]), to an ICC of 0.979 for IVH (95% CI [0.984; 0.993]), and an ICC of 0.886 for PHE (95% CI [0.760; 0.938]), all p-values < 0.001. Intrarater agreements ranged from an ICC of 0.997 for ICH (95% CI [0.996; 0.998]), to an ICC of 0.995 for IVH (95% CI [0.992; 0.996]), and an ICC of 0.980 for PHE (95% CI [0.971; 0.987]), all p-values < 0.001. Conclusion Manual segmentations of ICH, IVH, and PHE demonstrate good-to-excellent inter- and intrarater reliabilities, with the highest agreement for ICH and IVH and lowest for PHE. Therefore, the degree of variances reported in fully automated quantification methods might be related amongst others to variances in ground-truth masks.
Vogt Estelle, Vu Ly Huong, Cao Haoyin, Speth Anna, Desser Dmitriy, Schlunk Frieder, Dell’Orco Andrea, Nawabi Jawed
2023-Jan-11
computed tomography, deep learning, ground-truth, interrater reliability, intracranial hemorrhage, intrarater reliability, intraventricular hemorrhage, perihematomal edema