Lights Out: A Nighttime UAV Localization Framework Using Thermal Imagery and Semantic 3D Maps

arXiv cs.RO / 4/30/2026

📰 NewsDeveloper Stack & InfrastructureSignals & Early TrendsModels & Research

Key Points

  • The paper addresses the open problem of reliable UAV backup localization in GNSS-denied nighttime settings, where day RGB maps and night thermal imagery differ dramatically.
  • It proposes a semantic reprojection framework that aligns segmented thermal observations to a globally referenced, semantically labeled 3D map built from daytime RGB data, avoiding purely appearance-based matching.
  • Localization is formulated using a shared semantic domain and optimized via a symmetric bidirectional reprojection objective with confusion-aware weighting to handle segmentation uncertainty.
  • Offline evaluation on 6.5 km of real nighttime UAV trajectories shows strong accuracy versus RTK GNSS ground truth, achieving a bias-corrected RMSE2D of 2.18 m and a median RMSE2D of 1.52 m.
  • The study finds performance depends heavily on semantic edge evidence, and large localization errors cluster in spatially semantically ambiguous areas rather than spreading uniformly, suggesting practical targets for robustness improvements.

Abstract

Reliable backup localization for unmanned aerial vehicles (UAVs) operating in GNSS-denied nighttime conditions remains an open challenge due to the severe modality gap between daytime RGB maps and nighttime thermal imagery. This work presents a semantic reprojection framework for map-relative nighttime UAV localization by aligning segmented thermal observations with a globally referenced, semantically labeled 3D map constructed from daytime RGB data. Rather than relying on appearance-based correspondence, localization is formulated in a shared semantic domain and solved via a symmetric bidirectional reprojection objective with confusion-aware weighting to improve robustness under segmentation uncertainty. The approach is evaluated offline across 6.5 km of nighttime, real-world UAV flight trajectories in urban and semi-structured environments. Relative to RTK GNSS ground truth, the system achieves a bias-corrected RMSE2D of 2.18 m and a median RMSE2D of 1.52 m. Results show that localization performance is strongly correlated with the availability of semantic edge evidence and that large-error events are spatially localized to semantically ambiguous areas rather than uniformly distributed. These findings indicate that semantic reprojection offers a promising pathway toward globally referenced nighttime UAV localization using thermal imagery alone.