Exclusivity-Guided Mask Learning for Semi-Supervised Crowd Instance Segmentation and Counting
arXiv cs.CV / 3/18/2026
📰 NewsIdeas & Deep AnalysisModels & Research
Key Points
- The paper introduces Exclusion-Constrained Dual-Prompt SAM (EDP-SAM) with a Nearest Neighbor Exclusion Circle (NNEC) constraint to generate mask supervision for crowded-scene datasets.
- It then proposes Exclusivity-Guided Mask Learning (XMask), a discriminative objective that enforces spatial separation between instances to improve segmentation in dense crowds.
- The approach employs Gaussian smoothing and a differentiable center sampling strategy to enhance feature continuity and training stability during semi-supervised learning.
- A semi-supervised crowd counting framework built on XMask uses instance mask priors as pseudo-labels, achieving state-of-the-art performance on ShanghaiTech A, UCF-QNRF, and JHU++ with 5–40% labeled data and bridging counting with instance segmentation in a unified model.




