RareSpot+: A Benchmark, Model, and Active Learning Framework for Small and Rare Wildlife in Aerial Imagery

arXiv cs.CV / 4/23/2026

📰 NewsDeveloper Stack & InfrastructureSignals & Early TrendsModels & Research

Key Points

  • RareSpot+ is a new benchmark, model, and active learning framework aimed at improving automated wildlife monitoring in aerial imagery, focusing on the hard cases of small and rare species.
  • The approach boosts small-object localization without architectural changes by using a novel multi-scale consistency loss that aligns intermediate feature maps across detection heads.
  • It increases robustness and ecological plausibility through context-aware augmentation, generating hard but realistic training examples to better handle visually subtle targets.
  • A geospatially guided active learning module uses spatial priors (e.g., prairie dogs and burrows), test-time augmentation, and a meta-uncertainty model to reduce redundant expert labeling while improving detection accuracy.
  • Experiments on a 2 km² aerial dataset show a +35.2% improvement in mAP@50 over the baseline, and cross-dataset transfer results plus active learning gains (14.5% AP with only 1.7% labeling budget) demonstrate practical effectiveness.

Abstract

Automated wildlife monitoring from aerial imagery is vital for conservation but remains limited by two persistent challenges: the difficulty of detecting small, rare species and the high cost of large-scale expert annotation. Prairie dogs exemplify this problem -- they are ecologically important yet appear tiny, sparsely distributed, and visually indistinct from their surroundings, posing a severe challenge for conventional detection models. To overcome these limitations, we present RareSpot+, a detection framework that integrates multi-scale consistency learning, context-aware augmentation, and geospatially guided active learning to address these issues. A novel multi-scale consistency loss aligns intermediate feature maps across detection heads, enhancing localization of small (approx. 30 pixels wide) objects without architectural changes, while context-aware augmentation improves robustness by synthesizing hard, ecologically plausible examples. A geospatial active learning module exploits domain-specific spatial priors linking prairie dogs and burrows, together with test-time augmentation and a meta-uncertainty model, to reduce redundant labeling. On a 2 km^2 aerial dataset, RareSpot+ improves detection over the baseline mAP@50 by +35.2% (absolute +0.13). Cross-dataset tests on HerdNet, AED, and several other wildlife benchmarks demonstrate robust detector-level transferability. The active learning module further boosts prairie dog AP by 14.5% using an annotation budget of just 1.7% of the unlabeled tiles. Beyond detection, RareSpot+ enables spatial ecological analyses such as clustering and co-occurrence, linking vision-based detection with quantitative ecology.