Genie Sim PanoRecon: Fast Immersive Scene Generation from Single-View Panorama

arXiv cs.RO / 4/9/2026

📰 NewsSignals & Early TrendsTools & Practical UsageModels & Research

Key Points

  • Genie Sim PanoRecon is introduced as a feed-forward Gaussian-splatting pipeline that reconstructs high-fidelity 3D scenes from a single-view panorama for robotic manipulation simulation.
  • The method splits the panorama into six cube-map faces, processes them in parallel, and reassembles them while preserving geometric consistency across views.
  • It uses a depth-aware fusion strategy plus a training-free depth-injection module to guide a monocular feed-forward network toward coherent 3D Gaussian outputs.
  • The pipeline can generate photo-realistic 3D scenes in seconds and is integrated into Genie Sim, an LLM-driven platform for embodied synthetic data generation and task evaluation.

Abstract

We present Genie Sim PanoRecon, a feed-forward Gaussian-splatting pipeline that delivers high-fidelity, low-cost 3D scenes for robotic manipulation simulation. The panorama input is decomposed into six non-overlapping cube-map faces, processed in parallel, and seamlessly reassembled. To guarantee geometric consistency across views, we devise a depth-aware fusion strategy coupled with a training-free depth-injection module that steers the monocular feed-forward network to generate coherent 3D Gaussians. The whole system reconstructs photo-realistic scenes in seconds and has been integrated into Genie Sim - a LLM-driven simulation platform for embodied synthetic data generation and evaluation - to provide scalable backgrounds for manipulation tasks. For code details, please refer to: https://github.com/AgibotTech/genie_sim/tree/main/source/geniesim_world.