NoOVD: Novel Category Discovery and Embedding for Open-Vocabulary Object Detection
arXiv cs.CV / 3/24/2026
💬 OpinionIdeas & Deep AnalysisModels & Research
Key Points
- The paper identifies a key train–test mismatch in open-vocabulary object detection, where unlabeled novel-category objects are repeatedly treated as background and filtered out early, reducing recall at inference.
- It introduces NoOVD, a training framework that uses self-distillation from frozen vision-language models (VLMs) to help the detector discover novel categories without needing extra data.
- A proposed component (K-FPN) transfers VLM knowledge to guide novel-category discovery and avoid forced alignment of novel objects with background.
- During inference, the method adds R-RPN to adjust proposal confidence scores to improve the recall of novel-category objects.
- Experiments across OV-LVIS, OV-COCO, and Objects365 show consistent performance improvements across multiple evaluation metrics.
Related Articles
Regulating Prompt Markets: Securities Law, Intellectual Property, and the Trading of Prompt Assets
Dev.to
Mercor competitor Deccan AI raises $25M, sources experts from India
Dev.to
How We Got Local MCP Servers Working in Claude Cowork (The Missing Guide)
Dev.to
How Should Students Document AI Usage in Academic Work?
Dev.to
They Did Not Accidentally Make Work the Answer to Who You Are
Dev.to