MonoEM-GS: Monocular Expectation-Maximization Gaussian Splatting SLAM

arXiv cs.RO / 4/14/2026

📰 NewsSignals & Early TrendsModels & Research

共有:

Key Points

MonoEM-GS is a monocular SLAM mapping pipeline that uses feed-forward geometric priors from RGB to build a global Gaussian Splatting representation.
The method addresses view-dependent/noisy geometry and local metric drift by coupling Gaussian Splatting with an Expectation–Maximization formulation to stabilize reconstruction.
For pose estimation, MonoEM-GS employs ICP-based alignment to improve monocular camera motion estimation robustness.
It parameterizes Gaussians with multi-modal features, enabling in-place open-set segmentation and other downstream queries directly on the reconstructed map.
The approach is evaluated on 7-Scenes, TUM RGB-D, and Replica, with comparisons to recent baselines.

Abstract

Feed-forward geometric foundation models can infer dense point clouds and camera motion directly from RGB streams, providing priors for monocular SLAM. However, their predictions are often view-dependent and noisy: geometry can vary across viewpoints and under image transformations, and local metric properties may drift between frames. We present MonoEM-GS, a monocular mapping pipeline that integrates such geometric predictions into a global Gaussian Splatting representation while explicitly addressing these inconsistencies. MonoEM-GS couples Gaussian Splatting with an Expectation--Maximization formulation to stabilize geometry, and employs ICP-based alignment for monocular pose estimation. Beyond geometry, MonoEM-GS parameterizes Gaussians with multi-modal features, enabling in-place open-set segmentation and other downstream queries directly on the reconstructed map. We evaluate MonoEM-GS on 7-Scenes, TUM RGB-D and Replica, and compare against recent baselines.

Black Hat Asia

AI Business

From Hype to Hyperproductivity: How Boomi Agentstudio Turns Experimental AI Agents into Real-World Powerhouses

Dev.to

Choosing the Right Voice: A Technical Comparison of Pocket Studio Models

Dev.to

Agent Diary: Apr 15, 2026 - The Day I Became a Living Workflow Witness (While Run 241 Writes This Very Entry)

Dev.to

I Ran 163 Benchmarks Across 10 LLMs So You Don't Have To. Here's What I Found

Dev.to

MonoEM-GS: Monocular Expectation-Maximization Gaussian Splatting SLAM

Key Points

Abstract

Related Articles

Black Hat Asia

From Hype to Hyperproductivity: How Boomi Agentstudio Turns Experimental AI Agents into Real-World Powerhouses

Choosing the Right Voice: A Technical Comparison of Pocket Studio Models

Agent Diary: Apr 15, 2026 - The Day I Became a Living Workflow Witness (While Run 241 Writes This Very Entry)

I Ran 163 Benchmarks Across 10 LLMs So You Don't Have To. Here's What I Found

関連おすすめサービス

Notta搭載AI議事録イヤホン ZENCHORD1

AI搭載ボイスレコーダー Plaud

画像高画質化AIツール Aiarty Image Enhancer