RMGS-SLAM: Real-time Multi-sensor Gaussian Splatting SLAM

arXiv cs.RO / 4/15/2026

📰 NewsIdeas & Deep AnalysisModels & Research

Key Points

  • The paper introduces RMGS-SLAM, a tightly coupled LiDAR–Inertial–Visual (LIV) SLAM framework that uses real-time 3D Gaussian splatting to jointly provide low-latency pose estimation and continuous dense mapping in large-scale outdoor scenes.
  • It parallelizes state estimation, 3D Gaussian primitive initialization, and global Gaussian optimization to keep reconstruction synchronized with incoming sensor streams.
  • To improve initialization quality and speed up convergence, the authors propose a cascaded strategy combining feed-forward predictions with voxel-based PCA (voxel-PCA) geometric priors.
  • For long-term global consistency, RMGS-SLAM performs loop closure directly on the optimized global Gaussian map using Gaussian-based generalized ICP (GICP) to estimate loop constraints, followed by pose-graph optimization.
  • The authors also release hardware-synchronized LiDAR–camera–IMU datasets with ground-truth trajectories and report extensive experiments showing a balanced tradeoff among real-time efficiency, localization accuracy, and rendering quality.

Abstract

Real-time 3D Gaussian splatting (3DGS)-based Simultaneous Localization and Mapping (SLAM) in large-scale real-world environments remains challenging, as existing methods often struggle to jointly achieve low-latency pose estimation, 3D Gaussian reconstruction in step with incoming sensor streams, and long-term global consistency. In this paper, we present a tightly coupled LiDAR-Inertial-Visual (LIV) 3DGS-based SLAM framework for real-time pose estimation and photorealistic mapping in large-scale real-world scenes. The system executes state estimation and 3D Gaussian primitive initialization in parallel with global Gaussian optimization, thereby enabling continuous dense mapping. To improve Gaussian initialization quality and accelerate optimization convergence, we introduce a cascaded strategy that combines feed-forward predictions with voxel-based principal component analysis (voxel-PCA) geometric priors. To enhance global consistency in large scenes, we further perform loop closure directly on the optimized global Gaussian map by estimating loop constraints through Gaussian-based Generalized Iterative Closest Point (GICP) registration, followed by pose-graph optimization. In addition, we collected challenging large-scale looped outdoor SLAM sequences with hardware-synchronized LiDAR-camera-IMU and ground-truth trajectories to support realistic and comprehensive evaluation. Extensive experiments on both public datasets and our dataset demonstrate that the proposed method achieves a strong balance among real-time efficiency, localization accuracy, and rendering quality across diverse and challenging real-world scenes.