UAV-DETR: DETR for Anti-Drone Target Detection

arXiv cs.CV / 3/25/2026

📰 NewsSignals & Early TrendsIdeas & Deep AnalysisModels & Research

Key Points

  • UAV-DETR is a new DETR-based deep learning framework aimed at improving anti-drone/anti-UAV detection, especially for miniature drones in complex backgrounds and harsh environmental conditions.
  • The method combines a WTConv-enhanced backbone and a Sliding Window Self-Attention (SWSA-IFI) encoder to preserve high-frequency structural details of tiny targets while cutting parameter overhead for real-time performance.
  • It adds an Efficient Cross-Scale Feature Recalibration and Fusion Network (ECFRFN) to suppress background noise and better fuse multi-scale semantics for more accurate detection.
  • UAV-DETR further improves training robustness for small objects by using a hybrid Inner-CIoU and NWD loss to reduce sensitivity to minor positional errors compared with standard IoU.
  • Experiments report clear gains over RT-DETR on both a custom UAV dataset and the DUT-ANTI-UAV benchmark, with additional efficiency benefits (higher accuracy alongside a substantial parameter reduction) and published code on GitHub.

Abstract

Drone detection is pivotal in numerous security and counter-UAV applications. However, existing deep learning-based methods typically struggle to balance robust feature representation with computational efficiency. This challenge is particularly acute when detecting miniature drones against complex backgrounds under severe environmental interference. To address these issues, we introduce UAV-DETR, a novel framework that integrates a small-target-friendly architecture with real-time detection capabilities. Specifically, UAV-DETR features a WTConv-enhanced backbone and a Sliding Window Self-Attention (SWSA-IFI) encoder, capturing the high-frequency structural details of tiny targets while drastically reducing parameter overhead. Furthermore, we propose an Efficient Cross-Scale Feature Recalibration and Fusion Network (ECFRFN) to suppress background noise and aggregate multi-scale semantics. To further enhance accuracy, UAV-DETR incorporates a hybrid Inner-CIoU and NWD loss strategy, mitigating the extreme sensitivity of standard IoU metrics to minor positional deviations in small objects. Extensive experiments demonstrate that UAV-DETR significantly outperforms the baseline RT-DETR on our custom UAV dataset (+6.61% in mAP50:95, with a 39.8% reduction in parameters) and the public DUT-ANTI-UAV benchmark (+1.4% in Precision, +1.0% in F1-Score). These results establish UAV-DETR as a superior trade-off between efficiency and precision in counter-UAV object detection. The code is available at https://github.com/wd-sir/UAVDETR.