Adaptive Dual Residual U-Net with Attention Gate and Multiscale Spatial Attention Mechanisms (ADRUwAMS)

arXiv cs.CV / 4/13/2026

💬 OpinionSignals & Early TrendsIdeas & Deep AnalysisModels & Research

Key Points

  • The paper introduces ADRUwAMS, an Adaptive Dual Residual U-Net that combines dual adaptive residual networks, attention gates, and multiscale spatial attention to improve glioma tumor segmentation from brain images.
  • Its dual-branch design is intended to jointly capture high-level semantic cues and detailed low-level features, helping distinguish tumor regions by type, location, and challenging appearance.
  • Attention gates compute attention coefficients using both gating signals and input features to emphasize relevant activations during segmentation.
  • Multiscale spatial attention produces scaled attention maps and fuses them to retain the most informative tumor-related spatial information.
  • Trained for 200 epochs with ReLU on BraTS 2019 and BraTS 2020, the model reports strong Dice scores on BraTS 2020: 0.9229 (whole tumor), 0.8432 (tumor core), and 0.8004 (enhancing tumor).

Abstract

Glioma is a harmful brain tumor that requires early detection to ensure better health results. Early detection of this tumor is key for effective treatment and requires an automated segmentation process. However, it is a challenging task to find tumors due to tumor characteristics like location and size. A reliable method to accurately separate tumor zones from healthy tissues is deep learning models, which have shown promising results over the last few years. In this research, an Adaptive Dual Residual U-Net with Attention Gate and Multiscale Spatial Attention Mechanisms (ADRUwAMS) is introduced. This model is an innovative combination of adaptive dual residual networks, attention mechanisms, and multiscale spatial attention. The dual adaptive residual network architecture captures high-level semantic and intricate low-level details from brain images, ensuring precise segmentation of different tumor parts, types, and hard regions. The attention gates use gating and input signals to compute attention coefficients for the input features, and multiscale spatial attention generates scaled attention maps and combines these features to hold the most significant information about the brain tumor. We trained the model for 200 epochs using the ReLU activation function on BraTS 2020 and BraTS 2019 datasets. These improvements resulted in high accuracy for tumor detection and segmentation on BraTS 2020, achieving dice scores of 0.9229 for the whole tumor, 0.8432 for the tumor core, and 0.8004 for the enhancing tumor.