Learning from Noisy Prompts: Saliency-Guided Prompt Distillation for Robust Segmentation with SAM

arXiv cs.CV / 4/28/2026

📰 NewsIdeas & Deep AnalysisModels & Research

共有:

Key Points

The paper addresses a key limitation of SAM in medical imaging: its zero-shot segmentation can fail when prompts are weak, generic, or noisy, as is common in real clinical workflows.
It proposes SPD (Saliency-Guided Prompt Distillation), which learns anatomical priors via a lightweight saliency head to produce more reliable localization guidance.
SPD uses Contextual Prompt Distillation to validate and enrich noisy prompts by leveraging cues from anatomically adjacent slices, aiming to form a consensus prompt set aligned with expert-like reasoning.
A Pairwise Slice Consistency objective enforces local anatomical coherence, improving both mask regions and boundaries.
Experiments on four difficult MRI/CT benchmarks show SPD consistently outperforming existing SAM adaptations and supervised baselines across multiple segmentation metrics.

Abstract

Segmentation is central to clinical diagnosis and monitoring, yet the reliability of modern foundation models in medical imaging still depends on the availability of precise prompts. The Segment Anything Model (SAM) offers powerful zero-shot capabilities, although it collapses under the weak, generic, and noisy prompts that dominate real clinical workflows. In practice, annotations such as centerline points are coarse and ambiguous, often drifting across neighboring anatomy and misguiding SAM toward inconsistent or incomplete masks. We introduce SPD, a Saliency-Guided Prompt Distillation framework that converts these unreliable cues into robust guidance. SPD first learns data-driven anatomical priors through a lightweight saliency head to obtain confident localization maps. These priors then drive Contextual Prompt Distillation, which validates and enriches noisy prompts using cues from anatomically adjacent slices, producing a consensus prompt set that matches the behavior of expert reasoning. A Pairwise Slice Consistency objective further enforces local anatomical coherence during segmentation. Experiments on four challenging MRI and CT benchmarks demonstrate that SPD consistently outperforms existing SAM adaptations and supervised baselines, delivering large gains in both region-based and boundary-based metrics. SPD provides a practical and principled path toward reliable foundation model deployment in clinical environments where only imperfect prompts are available.