Hierarchical Visual Relocalization with Nearest View Synthesis from Feature Gaussian Splatting

arXiv cs.CV / 4/1/2026

📰 NewsIdeas & Deep AnalysisModels & Research

Key Points

  • The paper presents SplatHLoc, a hierarchical visual relocalization framework that estimates camera pose for previously known scenes using Feature Gaussian Splatting as the underlying scene representation.
  • It improves handling of sparse database observations via an adaptive viewpoint retrieval method that synthesizes virtual candidate viewpoints better aligned with the query to strengthen initial pose estimates.
  • For matching, the method uses a hybrid strategy that leverages Gaussian-rendered features for coarse matching and directly extracted image features for fine matching, aiming to combine complementary strengths.
  • Experiments on indoor and outdoor datasets show improved robustness and report state-of-the-art performance for visual relocalization.

Abstract

Visual relocalization is a fundamental task in the field of 3D computer vision, estimating a camera's pose when it revisits a previously known scene. While point-based hierarchical relocalization methods have shown strong scalability and efficiency, they are often limited by sparse image observations and weak feature matching. In this work, we propose SplatHLoc, a novel hierarchical visual relocalization framework that uses Feature Gaussian Splatting as the scene representation. To address the sparsity of database images, we propose an adaptive viewpoint retrieval method that synthesizes virtual candidates with viewpoints more closely aligned with the query, thereby improving the accuracy of initial pose estimation. For feature matching, we observe that Gaussian-rendered features and those extracted directly from images exhibit different strengths across the two-stage matching process: the former performs better in the coarse stage, while the latter proves more effective in the fine stage. Therefore, we introduce a hybrid feature matching strategy, enabling more accurate and efficient pose estimation. Extensive experiments on both indoor and outdoor datasets show that SplatHLoc enhances the robustness of visual relocalization, setting a new state-of-the-art.