Align then Refine: Text-Guided 3D Prostate Lesion Segmentation
arXiv cs.CV / 4/22/2026
📰 NewsDeveloper Stack & InfrastructureModels & Research
Key Points
- The paper targets accurate 3D segmentation of prostate lesions from biparametric MRI, where cross-modal fusion and lesion-level semantics are still difficult for existing models.
- It introduces a new multi-encoder U-Net design that adds (1) an alignment loss to strengthen foreground text-image similarity for lesion semantics, and (2) a heatmap loss to better calibrate similarity maps while suppressing background activations.
- A final-stage confidence-gated multi-head cross-attention “refiner” performs localized boundary edits only in high-confidence regions to improve segmentation precision.
- A phase-scheduled training strategy is used to stabilize optimization across the multiple components, and the method reports consistent gains, achieving new state-of-the-art results on the PI-CAI dataset.
- The authors provide open-source code for the proposed approach to support reproduction and further research.
Related Articles

GeoReg LLM-Driven Few-Shot Socio-Economic Estimation for Data-Scarce Regions
Dev.to

Enterprise AI Governance Has Shifted from Policy to Execution
Dev.to

Big Tech firms are accelerating AI investments and integration, while regulators and companies focus on safety and responsible adoption.
Dev.to

Rethinking CNN Models for Audio Classification
Dev.to

Database world trying to build natural language query systems again – this time with LLMs
The Register