AI Navigate

Prompt-Free Universal Region Proposal Network

arXiv cs.CV / 3/19/2026

📰 NewsModels & Research

Key Points

  • The paper introduces the Prompt-Free Universal Region Proposal Network (PF-RPN), a method that identifies potential objects without relying on external prompts.
  • It consists of three modules: Sparse Image-Aware Adapter (SIA) for initial localization, Cascade Self-Prompt (CSP) for discovering remaining objects, and Centerness-Guided Query Selection (CG-QS) to select high-quality query embeddings.
  • PF-RPN can be optimized with limited data (e.g., 5% of MS COCO) and applied directly to diverse domains such as underwater object detection, industrial defect detection, and remote sensing without fine-tuning.
  • Experimental results on 19 datasets demonstrate strong cross-domain effectiveness and generalization.
  • The authors provide code at the linked GitHub repository.

Abstract

Identifying potential objects is critical for object recognition and analysis across various computer vision applications. Existing methods typically localize potential objects by relying on exemplar images, predefined categories, or textual descriptions. However, their reliance on image and text prompts often limits flexibility, restricting adaptability in real-world scenarios. In this paper, we introduce a novel Prompt-Free Universal Region Proposal Network (PF-RPN), which identifies potential objects without relying on external prompts. First, the Sparse Image-Aware Adapter (SIA) module performs initial localization of potential objects using a learnable query embedding dynamically updated with visual features. Next, the Cascade Self-Prompt (CSP) module identifies the remaining potential objects by leveraging the self-prompted learnable embedding, autonomously aggregating informative visual features in a cascading manner. Finally, the Centerness-Guided Query Selection (CG-QS) module facilitates the selection of high-quality query embeddings using a centerness scoring network. Our method can be optimized with limited data (e.g., 5% of MS COCO data) and applied directly to various object detection application domains for identifying potential objects without fine-tuning, such as underwater object detection, industrial defect detection, and remote sensing image object detection. Experimental results across 19 datasets validate the effectiveness of our method. Code is available at https://github.com/tangqh03/PF-RPN.