In European radiology ; h5-index 62.0
OBJECTIVES : Diagnosis of flatfoot using a radiograph is subject to intra- and inter-observer variabilities. Here, we developed a cascade convolutional neural network (CNN)-based deep learning model (DLM) for an automated angle measurement for flatfoot diagnosis using landmark detection.
METHODS : We used 1200 weight-bearing lateral foot radiographs from young adult Korean males for the model development. An experienced orthopedic surgeon identified 22 radiographic landmarks and measured three angles for flatfoot diagnosis that served as the ground truth (GT). Another orthopedic surgeon (OS) and a general physician (GP) independently identified the landmarks of the test dataset and measured the angles using the same method. External validation was performed using 100 and 17 radiographs acquired from a tertiary referral center and a public database, respectively.
RESULTS : The DLM showed smaller absolute average errors from the GT for the three angle measurements for flatfoot diagnosis compared with both human observers. Under the guidance of the DLM, the average errors of observers OS and GP decreased from 2.35° ± 3.01° to 1.55° ± 2.09° and from 1.99° ± 2.76° to 1.56° ± 2.19°, respectively (both p < 0.001). The total measurement time decreased from 195 to 135 min in observer OS and from 205 to 155 min in observer GP. The absolute average errors of the DLM in the external validation sets were similar or superior to those of human observers in the original test dataset.
CONCLUSIONS : Our CNN model had significantly better accuracy and reliability than human observers in diagnosing flatfoot, and notably improved the accuracy and reliability of human observers.
KEY POINTS : • Development of deep learning model (DLM) that allows automated angle measurements for landmark detection based on 1200 weight-bearing lateral radiographs for diagnosing flatfoot. • Our DLM showed smaller absolute average errors for flatfoot diagnosis compared with two human observers. • Under the guidance of the model, the average errors of two human observers decreased and total measurement time also decreased from 195 to 135 min and from 205 to 155 min.
Ryu Seung Min, Shin Keewon, Shin Soo Wung, Lee Sun Ho, Seo Su Min, Cheon Seung-Uk, Ryu Seung-Ah, Kim Min-Ju, Kim Hyunjung, Doh Chang Hyun, Choi Young Rak, Kim Namkug
2023-Mar-01
Computer-assisted diagnosis, Deep Learning, Flatfoot, Observer variation, X-rays