SDDF: Specificity-Driven Dynamic Focusing for Open-Vocabulary Camouflaged Object Detection
arXiv cs.CV / 3/30/2026
💬 OpinionSignals & Early TrendsIdeas & Deep AnalysisModels & Research
Key Points
- The paper introduces SDDF (Specificity-Driven Dynamic Focusing), a method for open-vocabulary camouflaged object detection that targets failures caused by high visual similarity between camouflaged objects and their backgrounds.
- It builds a new benchmark, OVCOD-D, by augmenting selected camouflaged object images with fine-grained text descriptions to support open-vocabulary evaluation.
- The approach uses multimodal LLM-generated sub-descriptions but filters out confusing or overly decorative textual modifiers via a sub-description principal component contrastive fusion strategy.
- It further improves discrimination using specificity-guided regional weak alignment and dynamic focusing to strengthen localization of camouflaged objects under an open-set setting.
- On OVCOD-D, the proposed method reports an AP of 56.4, indicating effectiveness on the newly defined benchmark.




