SGP-SAM: Self-Gated Prompting for Transferring 3D Segment Anything Models to Lesion Segmentation

arXiv cs.CV / 4/28/2026

📰 NewsModels & Research

Key Points

  • The paper introduces SGP-SAM, a self-gated prompting framework aimed at transferring 3D SAM-style segmentation models to medical lesion segmentation, which is hard due to limited spatial capacity and severe 3D foreground–background imbalance.
  • SGP-SAM’s Self-Gated Prompting Module (SGPM) uses a lightweight multi-channel gating unit to decide when to apply multi-scale feature fusion, improving efficiency while enhancing spatial context only when needed.
  • To better learn small lesions, the method proposes a Zoom Loss that re-weights lesion-focused supervision by combining Dice loss with a voxel-balanced focal term.
  • Experiments on MSD Liver Tumor and MSD Brain Tumor (enhancing tumor) show improvements over strong SAM-Med3D-based transfer baselines, including a 7.3% mDice gain on MSD Liver Tumor compared with fine-tuning.
  • The work highlights practical architectural strategies for adapting general 3D segmentation foundation models to challenging medical segmentation tasks involving small, irregular targets.

Abstract

Large segmentation foundation models such as the Segment Anything Model (SAM) have reshaped promptable segmentation in natural images, and recent efforts have extended these models to medical images and volumetric settings. However, directly transferring a 3D SAM-style model to lesion segmentation remains challenging due to (i) weak spatial representational capacity for small, irregular targets in intermediate features, and (ii) extreme foreground-background imbalance in 3D volumes.We propose SGP-SAM, a self-gated prompting framework for efficient and effective transfer to 3D lesion segmentation. Our key component, the Self-Gated Prompting Module (SGPM), performs conditional multi-scale spatial enhancement: a lightweight multi-channel gating unit predicts whether the current features require additional multi-scale fusion, and only then activates a Multi-Scale Feature Fusion Block to enrich spatial context. To further address small-lesion learning, we design a Zoom Loss that up-weights lesion-focused supervision by combining Dice and a voxel-balanced focal term.Experiments on MSD Liver Tumor and MSD Brain Tumor (enhancing tumor) show consistent gains over strong transfer baselines based on SAM-Med3D. On MSD Liver Tumor, SGP-SAM improves mDice by 7.3% over fine-tuning.