DualSplat: Robust 3D Gaussian Splatting via Pseudo-Mask Bootstrapping from Reconstruction Failures

arXiv cs.CV / 4/24/2026

📰 NewsIdeas & Deep AnalysisModels & Research

Key Points

  • The paper highlights that 3D Gaussian Splatting (3DGS) can significantly degrade when training images include transient objects that break multi-view consistency.
  • It identifies a circular dependency in prior work: accurate transient detection needs a well-reconstructed static scene, but clean reconstruction itself requires reliable transient masks.
  • DualSplat resolves this by turning first-pass reconstruction failures into explicit priors for a second reconstruction stage, using those failures to build object-level pseudo-masks.
  • The method constructs pseudo-masks by combining photometric residuals, feature mismatches, and SAM2 instance boundaries, then refines them online with a lightweight MLP that shifts from prior supervision toward self-consistency.
  • Experiments on RobustNeRF and NeRF On-the-go show DualSplat outperforms baselines, with especially strong gains in scenes and regions heavy in transients.

Abstract

While 3D Gaussian Splatting (3DGS) achieves real-time photorealistic rendering, its performance degrades significantly when training images contain transient objects that violate multi-view consistency. Existing methods face a circular dependency: accurate transient detection requires a well-reconstructed static scene, while clean reconstruction itself depends on reliable transient masks. We address this challenge with DualSplat, a Failure-to-Prior framework that converts first-pass reconstruction failures into explicit priors for a second reconstruction stage. We observe that transients, which appear in only a subset of views, often manifest as incomplete fragments during conservative initial training. We exploit these failures to construct object-level pseudo-masks by combining photometric residuals, feature mismatches, and SAM2 instance boundaries. These pseudo-masks then guide a clean second-pass 3DGS optimization, while a lightweight MLP refines them online by gradually shifting from prior supervision to self-consistency. Experiments on RobustNeRF and NeRF On-the-go show that DualSplat outperforms existing baselines, demonstrating particularly clear advantages in transient-heavy scenes and transient regions.