PlanaReLoc: Camera Relocalization in 3D Planar Primitives via Region-Based Structure Matching

arXiv cs.CV / 3/24/2026

💬 OpinionSignals & Early TrendsIdeas & Deep AnalysisModels & Research

Key Points

  • PlanaReLoc presents a lightweight 6-DoF camera relocalization approach that leverages 3D planar primitives and planar maps to establish reliable correspondences between a query image and a map.
  • The method uses a deep matcher to associate planar primitives across modalities in a learned unified embedding space, followed by robust pose solving and refinement.
  • Experiments on ScanNet and 12Scenes show that planar primitives enable strong cross-modal structural matching and accurate relocalization.
  • The paper claims practical advantages including not requiring realistically textured/colored maps, pose priors, or per-scene training, which could reduce deployment costs in structured environments.
  • Code and data are released publicly via GitHub, enabling reproducibility and further experimentation.

Abstract

While structure-based relocalizers have long strived for point correspondences when establishing or regressing query-map associations, in this paper, we pioneer the use of planar primitives and 3D planar maps for lightweight 6-DoF camera relocalization in structured environments. Planar primitives, beyond being fundamental entities in projective geometry, also serve as region-based representations that encapsulate both structural and semantic richness. This motivates us to introduce PlanaReLoc, a streamlined plane-centric paradigm where a deep matcher associates planar primitives across the query image and the map within a learned unified embedding space, after which the 6-DoF pose is solved and refined under a robust framework. Through comprehensive experiments on the ScanNet and 12Scenes datasets across hundreds of scenes, our method demonstrates the superiority of planar primitives in facilitating reliable cross-modal structural correspondences and achieving effective camera relocalization without requiring realistically textured/colored maps, pose priors, or per-scene training. The code and data are available at https://github.com/3dv-casia/PlanaReLoc .