RefAerial: A Benchmark and Approach for Referring Detection in Aerial Images
arXiv cs.CV / 4/23/2026
📰 NewsSignals & Early TrendsModels & Research
Key Points
- The paper introduces RefAerial, a large-scale benchmark dataset for referring detection in aerial images, designed to overcome limitations of prior ground-image datasets.
- RefAerial is characterized by low but diverse object-to-scene ratios, many targets and distractors, complex fine-grained referring descriptions, and broad diverse aerial scenes.
- The authors develop REA-Engine, a human-in-the-loop semi-automated annotation system to efficiently generate referring pairs for the dataset.
- They find that existing ground referring detection models degrade significantly on aerial data due to scale variety issues, and propose a scale-comprehensive and sensitive (SCS) framework using mixture-of-granularity attention plus a comprehensive-to-sensitive two-stage decoding strategy.
- The proposed SCS framework delivers strong results on RefAerial and also shows performance gains on traditional ground referring detection datasets.
Related Articles

Big Tech firms are accelerating AI investments and integration, while regulators and companies focus on safety and responsible adoption.
Dev.to

Trajectory Forecasts in Unknown Environments Conditioned on Grid-Based Plans
Dev.to

10 AI Tools Every Developer Should Try in 2026
Dev.to

OpenAI Just Named It Workspace Agents. We Open-Sourced Our Lark Version Six Months Ago
Dev.to

GPT Image 2 Subject-Lock Editing: A Practical Guide to input_fidelity
Dev.to