AI Navigate

Catalyst4D: High-Fidelity 3D-to-4D Scene Editing via Dynamic Propagation

arXiv cs.CV / 3/16/2026

📰 NewsIdeas & Deep AnalysisModels & Research

Key Points

  • Catalyst4D introduces a framework to transfer high-quality 3D edits to dynamic 4D Gaussian scenes while preserving spatial and temporal coherence.
  • It employs Anchor-based Motion Guidance (AMG) to build structurally stable, region-level anchors from original and edited Gaussians, enabling robust deformation propagation.
  • The correspondences among anchors are established via optimal transport to minimize cross-region interference and motion drift.
  • Color Uncertainty-guided Appearance Refinement (CUAR) estimates per-Gaussian color uncertainty to maintain temporal appearance consistency and selectively refine occlusion-prone regions.
  • Experimental results demonstrate temporally stable, high-fidelity dynamic scene editing and superiority over existing methods in visual quality and motion coherence.

Abstract

Recent advances in 3D scene editing using NeRF and 3DGS enable high-quality static scene editing. In contrast, dynamic scene editing remains challenging, as methods that directly extend 2D diffusion models to 4D often produce motion artifacts, temporal flickering, and inconsistent style propagation. We introduce Catalyst4D, a framework that transfers high-quality 3D edits to dynamic 4D Gaussian scenes while maintaining spatial and temporal coherence. At its core, Anchor-based Motion Guidance (AMG) builds a set of structurally stable and spatially representative anchors from both original and edited Gaussians. These anchors serve as robust region-level references, and their correspondences are established via optimal transport to enable consistent deformation propagation without cross-region interference or motion drift. Complementarily, Color Uncertainty-guided Appearance Refinement (CUAR) preserves temporal appearance consistency by estimating per-Gaussian color uncertainty and selectively refining regions prone to occlusion-induced artifacts. Extensive experiments demonstrate that Catalyst4D achieves temporally stable, high-fidelity dynamic scene editing and outperforms existing methods in both visual quality and motion coherence.