Multicentric thrombus segmentation using an attention-based recurrent network with gradual modality dropout

arXiv cs.CV / 4/2/2026

💬 OpinionSignals & Early TrendsIdeas & Deep AnalysisModels & Research

Key Points

  • The paper addresses 3D medical image segmentation of small, low-contrast targets such as ischemic-stroke thrombi, which are difficult due to variability across imaging modalities and real-world multi-center domain shifts.
  • It proposes UpAttLLSTM, an attention-based recurrent (2.5D) segmentation network that aggregates slice context and uses attention gates to fuse information across available sequences.
  • A training strategy with progressive difficulty and gradual modality dropout is used to simulate missing modalities and site-specific heterogeneity, improving robustness and generalization to multi-center data.
  • Results report >90% detection with Dice 0.65 on a monocentric cohort, and about 80% detection with Dice ~0.35 in multi-center settings with missing modalities.
  • The authors claim the approach can transfer to other small-lesion segmentation tasks in 3D medical imaging where targets are scarce and modality-dependent.

Abstract

Detecting and delineating tiny targets in 3D brain scans is a central yet under-addressed challenge in medical imaging.In ischemic stroke, for instance, the culprit thrombus is small, low-contrast, and variably expressed across modalities(e.g., susceptibility-weighted T2 blooming, diffusion restriction on DWI/ADC), while real-world multi-center dataintroduce domain shifts, anisotropy, and frequent missing sequences. We introduce a methodology that couples an attention-based recurrent segmentation network (UpAttLLSTM), a training schedule that progressively increases the difficulty of hetero-modal learning, with gradual modality dropout, UpAttLLSTM aggregates context across slices via recurrent units (2.5D) and uses attention gates to fuse complementary cues across available sequences, making it robust to anisotropy and class imbalance. Gradual modality dropout systematically simulates site heterogeneity,noise, and missing modalities during training, acting as both augmentation and regularization to improve multi-center generalization. On a monocentric cohort, our approach detects thrombi in >90% of cases with a Dice score of 0.65. In a multi-center setting with missing modalities, it achieves-80% detection with a Dice score around 0.35. Beyond stroke, the proposed methodology directly transfers to other small-lesion tasks in 3D medical imaging where targets are scarce, subtle, and modality-dependent