ASGNet: Adaptive Spectrum Guidance Network for Automatic Polyp Segmentation

arXiv cs.CV / 4/17/2026

📰 NewsIdeas & Deep AnalysisModels & Research

Key Points

  • The paper introduces ASGNet, an adaptive spectrum guidance network designed to improve automatic polyp segmentation in colonoscopy images by addressing shortcomings in current models’ spatial-only perception.
  • ASGNet combines spectral features with global attributes using a spectrum-guided non-local perception module to better capture complete polyp structures and refine boundaries.
  • It adds a multi-source semantic extractor to leverage high-level semantic information for more reliable preliminary localization of polyps.
  • A dense cross-layer interaction decoder is used to integrate and strengthen representations across multiple network layers for more accurate final segmentation.
  • Experiments on five common benchmark datasets show ASGNet outperforms 21 state-of-the-art polyp segmentation methods, with code planned for public release on GitHub.

Abstract

Early identification and removal of polyps can reduce the risk of developing colorectal cancer. However, the diverse morphologies, complex backgrounds and often concealed nature of polyps make polyp segmentation in colonoscopy images highly challenging. Despite the promising performance of existing deep learning-based polyp segmentation methods, their perceptual capabilities remain biased toward local regions, mainly because of the strong spatial correlations between neighboring pixels in the spatial domain. This limitation makes it difficult to capture the complete polyp structures, ultimately leading to sub-optimal segmentation results. In this paper, we propose a novel adaptive spectrum guidance network, called ASGNet, which addresses the limitations of spatial perception by integrating spectral features with global attributes. Specifically, we first design a spectrum-guided non-local perception module that jointly aggregates local and global information, therefore enhancing the discriminability of polyp structures, and refining their boundaries. Moreover, we introduce a multi-source semantic extractor that integrates rich high-level semantic information to assist in the preliminary localization of polyps. Furthermore, we construct a dense cross-layer interaction decoder that effectively integrates diverse information from different layers and strengthens it to generate high-quality representations for accurate polyp segmentation. Extensive quantitative and qualitative results demonstrate the superiority of our ASGNet approach over 21 state-of-the-art methods across five widely-used polyp segmentation benchmarks. The code will be publicly available at: https://github.com/CSYSI/ASGNet.