TopoMamba: Topology-Aware Scanning and Fusion for Segmenting Heterogeneous Medical Visual Media

arXiv cs.CV / 4/29/2026

📰 NewsDeveloper Stack & InfrastructureIndustry & Market MovesModels & Research

Key Points

  • TopoMamba is a topology-aware scan-and-fusion framework designed to improve medical image segmentation with visual state-space models by addressing axis-biased scan ordering and inefficient multi-branch fusion.
  • The approach uses complementary TopoA-Scan branches (diagonal/anti-diagonal) alongside a standard Cross-Scan branch to better capture oblique and curved anatomical structures.
  • It introduces ScanCache, a device-aware caching mechanism that reduces the overhead of constructing scan indices across repeated resolutions.
  • For efficient fusion of heterogeneous scan features, TopoMamba proposes a lightweight HSIC Gate that applies dependence-aware scalar gating to regulate branch interactions.
  • Experiments on Synapse CT, ISIC 2017 dermoscopy, and CVC-ClinicDB endoscopy—and a volumetric TopoMamba-3D variant—show consistent improvements over CNN, Transformer, and SSM baselines, especially for thin or curved targets, while keeping deployment efficiency under dynamic input resolutions.

Abstract

Visual state-space models (SSMs) have shown strong potential for medical image segmentation, yet their effectiveness is often limited by two practical issues: axis-biased scan ordering weakens the modeling of oblique and curved structures, and naive multi-branch fusion tends to amplify redundant responses. We present TopoMamba, a topology-aware scan-and-fuse framework for segmenting heterogeneous medical visual media. The method combines a diagonal/anti-diagonal TopoA-Scan branch with the standard Cross-Scan branch to provide complementary structural priors, and introduces ScanCache, a device-aware caching mechanism that amortizes explicit scan-index construction across recurring resolutions. To fuse heterogeneous scan features efficiently, we further propose a lightweight HSIC Gate that regulates branch interaction using a dependence-aware scalar gating rule. We also instantiate a volumetric TopoMamba-3D for practical 3D clinical segmentation. Experiments on Synapse CT, ISIC 2017 dermoscopy, and CVC-ClinicDB endoscopy show that TopoMamba consistently improves segmentation quality over strong CNN, Transformer, and SSM baselines, with particularly clear gains on thin or curved targets such as the pancreas and gallbladder, while maintaining favorable deployment efficiency under dynamic input resolutions. These results suggest that topology-aware scan ordering and lightweight dependence-aware fusion form an effective and practical design for medical multimedia segmentation. The code will be made publicly available.