MedVeriSeg: Teaching MLLM-Based Medical Segmentation Models to Verify Query Validity Without Extra Training
arXiv cs.CV / 4/14/2026
💬 OpinionSignals & Early TrendsIdeas & Deep AnalysisModels & Research
Key Points
- The paper introduces MedVeriSeg, a training-free framework for LISA-like MLLM-based medical segmentation models to reject “false queries” that reference non-existent targets, addressing hallucinated mask generation.
- It leverages an observed difference in the distribution patterns of similarity maps between the [SEG] token feature and MLLM image features for true versus false queries.
- MedVeriSeg adds a Similarity Response Quality Scoring Module that scores the similarity map using three criteria—strength, compactness, and purity—to produce an initial existence prediction.
- For final verification, it incorporates qualitative visual evidence by using GPT-4o to jointly evaluate the similarity heatmap and the module’s scoring outputs.
- Experiments on a small benchmark derived from SA-Med2D-20M indicate MedVeriSeg can effectively reject false-query requests while keeping recognition of true queries reliable.



