DualGeo: A Dual-View Framework for Worldwide Image Geo-localization

arXiv cs.CV / 4/29/2026

📰 NewsDeveloper Stack & InfrastructureModels & Research

Key Points

  • DualGeo is a new two-stage framework for worldwide image geo-localization, targeting improved accuracy across street to continental scales.
  • It fuses image and semantic segmentation features using bidirectional cross-attention, then uses dual-view contrastive learning to align representations with GPS coordinates and build a global retrieval database.
  • For geo-cognitive refinement, DualGeo re-ranks candidate locations via geographic clustering before passing them to large multimodal models for final coordinate prediction.
  • Experiments on IM2GPS, IM2GPS3k, and YFCC4k show DualGeo surpasses prior state-of-the-art performance, with notable gains at both street (<1 km) and city (<25 km) levels.
  • The authors provide code and datasets via the linked GitHub repository, enabling reproducibility and further research.

Abstract

Worldwide image geo-localization aims to infer the geographic location of an image captured anywhere on Earth, spanning street, city, regional, national, and continental scales. Existing methods rely on visual features that are sensitive to environmental variations (e.g., lighting, season, and weather) and lack effective post-processing to filter outlier candidates, limiting localization accuracy. To address these limitations, we propose DualGeo, a two-stage framework for worldwide image geo-localization. First, it establishes a geo-representational foundation by fusing image and semantic segmentation features via bidirectional cross-attention. The fused features are then aligned with GPS coordinates through dual-view contrastive learning to build a global retrieval database. Second, it performs geo-cognitive refinement by re-ranking retrieved candidates using geographic clustering. It then feeds them into large multimodal models (LMMs) for final coordinate prediction. Experiments on IM2GPS, IM2GPS3k, and YFCC4k show that DualGeo outperforms state-of-the-art methods, improving street-level (<1 km) and city-level (<25 km) localization accuracy by 3.6%-16.58% and 1.29%-8.77%, respectively. Our code and datasets are available : https://github.com/CJ310177/DualGeo.