Exploring Hierarchical Consistency and Unbiased Objectness for Open-Vocabulary Object Detection
arXiv cs.CV / 4/28/2026
📰 NewsIdeas & Deep AnalysisModels & Research
Key Points
- The paper targets the limitations of open-vocabulary object detection (OVD), which typically relies on vision-language models to create pseudo labels for novel classes but can misassign labels and produce unreliable objectness scores.
- It introduces a hierarchical confidence calibration (HCC) method that improves class label estimation by enforcing consistency across hierarchical semantic levels (class, super-category, and sub-category).
- It proposes LoCLIP, a parameter-efficient adaptation of CLIP that adds an objectness token to reduce bias toward base classes in region proposal networks (RPNs) and better estimate objectness for novel categories.
- Experiments on major OVD benchmarks such as COCO and LVIS show the approach achieves new state-of-the-art performance, indicating strong effectiveness across standard evaluation settings.
Related Articles

Write a 1,200-word blog post: "What is Generative Engine Optimization (GEO) and why SEO teams need it now"
Dev.to

Indian Developers: How to Build AI Side Income with $0 Capital in 2026
Dev.to

Most People Use AI Like Google. That's Why It Sucks.
Dev.to

Behind the Scenes of a Self-Evolving AI: The Architecture of Tian AI
Dev.to

Tian AI vs ChatGPT: Why Local AI Is the Future of Privacy
Dev.to