SDD-YOLO: A Small-Target Detection Framework for Ground-to-Air Anti-UAV Surveillance with Edge-Efficient Deployment

arXiv cs.CV / 3/27/2026

💬 OpinionSignals & Early TrendsIdeas & Deep AnalysisModels & Research

Key Points

  • 論文は、地上から航空目線(G2A)で小型UAV(サブピクセル級)を検出するための専用フレームワーク「SDD-YOLO」を提案しています。
  • 4倍のダウンサンプリング解像度で動作するP2高解像度検出ヘッドにより、微小目標に必要な細かな空間情報をより確実に捉える設計です。
  • YOLO26のD F Lなし・NMSなしの推論向けアーキテクチャと、MuSGDのハイブリッド学習(ProgLoss、STAL)を組み合わせ、疎な小目標信号で起きやすい勾配の振動を抑制します。
  • 評価用に約3万枚の注釈画像からなるG2A向け大規模データセット「DroneSOD-30K」を構築し、SDD-YOLO-nはmAP@0.5で86.0%を達成してYOLOv5n比で+7.8pt改善しています。
  • 推論効率も高く、RTX 5090で226 FPS、Xeon CPUで35 FPSを示し、エッジでのリアルタイム運用を見据えた性能を示唆しています。

Abstract

Detecting small unmanned aerial vehicles (UAVs) from a ground-to-air (G2A) perspective presents significant challenges, including extremely low pixel occupancy, cluttered aerial backgrounds, and strict real-time constraints. Existing YOLO-based detectors are primarily optimized for general object detection and often lack adequate feature resolution for sub-pixel targets, while introducing complexities during deployment. In this paper, we propose SDD-YOLO, a small-target detection framework tailored for G2A anti-UAV surveillance. To capture fine-grained spatial details critical for micro-targets, SDD-YOLO introduces a P2 high-resolution detection head operating at 4 times downsampling. Furthermore, we integrate the recent architectural advancements from YOLO26, including a DFL-free, NMS-free architecture for streamlined inference, and the MuSGD hybrid training strategy with ProgLoss and STAL, which substantially mitigates gradient oscillation on sparse small-target signals. To support our evaluation, we construct DroneSOD-30K, a large-scale G2A dataset comprising approximately 30,000 annotated images covering diverse meteorological conditions. Experiments demonstrate that SDD-YOLO-n achieves a mAP@0.5 of 86.0% on DroneSOD-30K, surpassing the YOLOv5n baseline by 7.8 percentage points. Extensive inference analysis shows our model attains 226 FPS on an NVIDIA RTX 5090 and 35 FPS on an Intel Xeon CPU, demonstrating exceptional efficiency for future edge deployment.