CardioSAM: Topology-Aware Decoder Design for High-Precision Cardiac MRI Segmentation

arXiv cs.CV / 4/7/2026

📰 NewsSignals & Early TrendsIdeas & Deep AnalysisModels & Research

Key Points

  • CardioSAM is proposed as a topology-aware hybrid segmentation model for cardiac MRI, aiming to deliver clinically needed boundary precision beyond what general foundation-model segmenters typically provide.
  • The method freezes a SAM encoder for robust general feature extraction and adds a lightweight, trainable cardiac-specific decoder with (1) a Cardiac-Specific Attention module using anatomical topological priors and (2) a Boundary Refinement Module to sharpen tissue interfaces.
  • On the ACDC benchmark, CardioSAM reports Dice 93.39%, IoU 87.61%, pixel accuracy 99.20%, and HD95 4.2 mm, outperforming strong baselines including nnU-Net by +3.89% Dice.
  • The authors claim the results exceed inter-expert agreement levels (91.2%), suggesting the approach could reduce variability and improve reliability for clinical cardiac structure segmentation.

Abstract

Accurate segmentation of cardiac structures in cardiovascular magnetic resonance (CMR) images is essential for reliable diagnosis and treatment of cardiovascular diseases. However, manual segmentation remains time-consuming and suffers from significant inter-observer variability. Recent advances in deep learning, particularly foundation models such as the Segment Anything Model (SAM), demonstrate strong generalization but often lack the boundary precision required for clinical applications. To address this limitation, we propose CardioSAM, a hybrid architecture that combines the generalized feature extraction capability of a frozen SAM encoder with a lightweight, trainable cardiac-specific decoder. The proposed decoder introduces two key innovations: a Cardiac-Specific Attention module that incorporates anatomical topological priors, and a Boundary Refinement Module designed to improve tissue interface delineation. Experimental evaluation on the ACDC benchmark demonstrates that CardioSAM achieves a Dice coefficient of 93.39%, IoU of 87.61%, pixel accuracy of 99.20%, and HD95 of 4.2 mm. The proposed method surpasses strong baselines such as nnU-Net by +3.89% Dice and exceeds reported inter-expert agreement levels (91.2%), indicating its potential for reliable and clinically applicable cardiac segmentation.