LoD-Loc v3: Generalized Aerial Localization in Dense Cities using Instance Silhouette Alignment

arXiv cs.CV / 3/23/2026

📰 NewsIdeas & Deep AnalysisModels & Research

Key Points

  • LoD-Loc v3 introduces generalized aerial localization in dense urban environments by shifting from semantic silhouette alignment to instance silhouette alignment.
  • A new synthetic data pipeline creates InsLoD-Loc, the largest instance segmentation dataset for aerial imagery with 100k images and precise building annotations, enabling zero-shot generalization.
  • The method addresses cross-scene generalization and dense-building scene failures, reducing pose estimation ambiguity in dense urban areas.
  • Extensive experiments show LoD-Loc v3 outperforming state-of-the-art baselines by a large margin in both cross-scene and dense urban scenarios.
  • The project is available online at the provided URL (https://nudt-sawlab.github.io/LoD-Locv3/).

Abstract

We present LoD-Loc v3, a novel method for generalized aerial visual localization in dense urban environments. While prior work LoD-Loc v2 achieves localization through semantic building silhouette alignment with low-detail city models, it suffers from two key limitations: poor cross-scene generalization and frequent failure in dense building scenes. Our method addresses these challenges through two key innovations. First, we develop a new synthetic data generation pipeline that produces InsLoD-Loc - the largest instance segmentation dataset for aerial imagery to date, comprising 100k images with precise instance building annotations. This enables trained models to exhibit remarkable zero-shot generalization capability. Second, we reformulate the localization paradigm by shifting from semantic to instance silhouette alignment, which significantly reduces pose estimation ambiguity in dense scenes. Extensive experiments demonstrate that LoD-Loc v3 outperforms existing state-of-the-art (SOTA) baselines, achieving superior performance in both cross-scene and dense urban scenarios with a large margin. The project is available at https://nudt-sawlab.github.io/LoD-Locv3/.