AI Navigate

Evidential learning driven Breast Tumor Segmentation with Stage-divided Vision-Language Interaction

arXiv cs.CV / 3/13/2026

📰 NewsModels & Research

Key Points

  • The paper proposes TextBCS, a text-guided breast tumor segmentation model that uses stage-divided vision-language interaction to mutually exchange visual and text features at each down-sampling stage to improve locating lesions in low-contrast MRI.
  • It introduces evidential learning using variational Dirichlet to quantify segmentation uncertainty, addressing blurred boundaries.
  • The approach leverages text prompts to delineate segmentation regions, enhancing segmentation accuracy in challenging contrast scenarios.
  • Experimental results show TextBCS achieving superior performance compared with other segmentation networks on publicly available datasets.

Abstract

Breast cancer is one of the most common causes of death among women worldwide, with millions of fatalities annually. Magnetic Resonance Imaging (MRI) can provide various sequences for characterizing tumor morphology and internal patterns, and becomes an effective tool for detection and diagnosis of breast tumors. However, previous deep-learning based tumor segmentation methods have limitations in accurately locating tumor contours due to the challenge of low contrast between cancer and normal areas and blurred boundaries. Leveraging text prompt information holds promise in ameliorating tumor segmentation effect by delineating segmentation regions. Inspired by this, we propose text-guided Breast Tumor Segmentation model (TextBCS) with stage-divided vision-language interaction and evidential learning. Specifically, the proposed stage-divided vision-language interaction facilitates information mutual between visual and text features at each stage of down-sampling, further exerting the advantages of text prompts to assist in locating lesion areas in low contrast scenarios. Moreover, the evidential learning is adopted to quantify the segmentation uncertainty of the model for blurred boundary. It utilizes the variational Dirichlet to characterize the distribution of the segmentation probabilities, addressing the segmentation uncertainties of the boundaries. Extensive experiments validate the superiority of our TextBCS over other segmentation networks, showcasing the best breast tumor segmentation performance on publicly available datasets.