Receive a weekly summary and discussion of the top papers of the week by leading researchers in the field.

In Medical physics ; h5-index 59.0

BACKGROUND : U-Net and its variations have achieved remarkable performances in medical image segmentation. However, they have two limitations. First, the shallow layer feature of the encoder always contains background noise. Second, semantic gaps exist between the features of the encoder and the decoder. Skip-connections directly connect the encoder to the decoder, which will lead to the fusion of semantically dissimilar feature maps.

PURPOSE : To overcome these two limitations, this paper proposes a novel medical image segmentation algorithm, called feature-guided attention network, which consists of U-Net, the cross-level attention filtering module (CAFM), and the attention-guided upsampling module (AUM).

METHODS : In the proposed method, the AUM and the CAFM were introduced into the U-Net, where the AUM learns to filter the background noise in the low-level feature map of the encoder and the CAFM tries to eliminate the semantic gap between the encoder and the decoder. Specifically, the AUM adopts a top-down pathway to use the high-level feature map so as to filter the background noise in the low-level feature map of the encoder. The AUM uses the encoder features to guide the upsampling of the corresponding decoder features, thus eliminating the semantic gap between them. Four medical image segmentation tasks, including coronary atherosclerotic plaque segmentation (Dataset A), retinal vessel segmentation (Dataset B), skin lesion segmentation (Dataset C), and multi-class retinal edema lesions segmentation (Dataset D), were used to validate the proposed method.

RESULTS : For Dataset A, the proposed method achieved higher Intersection over Union (IoU) (67.91±3.82%), dice (79.39±3.37%), accuracy (98.39±0.34%), and sensitivity (85.10±3.74%) than the previous best method: CA-Net. For Dataset B, the proposed method achieved higher sensitivity (83.50%) and accuracy (97.55%) than the previous best method: SCS-Net. For Dataset C, the proposed method had highest IoU (83.47±0.41%) and dice (90.81±0.34%) than those of all compared previous methods. For Dataset D, the proposed method had highest dice (average: 81.53%; REA: 83.78%; PED 77.13%), sensitivity (REA: 89.01%; SRF: 85.50%), specificity (REA: 99.35%; PED: 100.00), and accuracy (98.73%) among all compared previous networks. In addition, the number of parameters of the proposed method was 2.43 M, which is less than CA-Net (3.21 M) and CPF-Net (3.07 M).

CONCLUSIONS : The proposed method demonstrated state-of-the-art performance, outperforming other top-notch medical image segmentation algoirthms. The CAFM filtered the background noise in the low-level feature map of the encoder, while the AUM eliminated the semantic gap between the encoder and the decoder. Furthermore, the proposed method was of high computational efficiency. This article is protected by copyright. All rights reserved.

Zhou Hao, Sun Chaoyu, Huang Hai, Fan Mingyu, Yang Xu, Zhou Linxiao

2023-Feb-06

attention-guided upsampling module, cross-level attention filtering module, medical image segmentation