GaussianSSC: Triplane-Guided Directional Gaussian Fields for 3D Semantic Completion

arXiv cs.RO / 2026/3/24

💬 オピニオンSignals & Early TrendsIdeas & Deep AnalysisModels & Research

要点

  • GaussianSSC is a two-stage, grid-native semantic scene completion method that uses triplane guidance while avoiding separate Gaussian sets or replacing the voxel grid entirely.
  • The approach introduces Gaussian Anchoring, performing sub-pixel, Gaussian-weighted aggregation over fused FPN features to tighten voxel–image alignment and improve monocular occupancy estimation.
  • It learns a per-voxel Gaussian field from point-like voxel features and refines triplane features using a Gaussian–Triplane Refinement module that mixes local (target-centric) gathering with global (source-centric) aggregation.
  • GaussianSSC employs directional, anisotropic Gaussian support to better capture surface tangency, scale, and occlusion-aware asymmetry while maintaining triplane efficiency.
  • On SemanticKITTI, the method reports improvements over state-of-the-art baselines, including +1.0% Recall, +2.0% Precision, and +1.8% IoU for Stage 1 occupancy, and +1.8% IoU plus +0.8% mIoU for Stage 2 semantic prediction.

Abstract

We present \emph{GaussianSSC}, a two-stage, grid-native and triplane-guided approach to semantic scene completion (SSC) that injects the benefits of Gaussians without replacing the voxel grid or maintaining a separate Gaussian set. We introduce \emph{Gaussian Anchoring}, a sub-pixel, Gaussian-weighted image aggregation over fused FPN features that tightens voxel--image alignment and improves monocular occupancy estimation. We further convert point-like voxel features into a learned per-voxel Gaussian field and refine triplane features via a triplane-aligned \emph{Gaussian--Triplane Refinement} module that combines \emph{local gathering} (target-centric) and \emph{global aggregation} (source-centric). This directional, anisotropic support captures surface tangency, scale, and occlusion-aware asymmetry while preserving the efficiency of triplane representations. On SemanticKITTI~\cite{behley2019semantickitti}, GaussianSSC improves Stage~1 occupancy by +1.0\% Recall, +2.0\% Precision, and +1.8\% IoU over state-of-the-art baselines, and improves Stage~2 semantic prediction by +1.8\% IoU and +0.8\% mIoU.