Diffusion Models are Secretly Zero-Shot 3DGS Harmonizers

arXiv cs.CV / 5/4/2026

💬 OpinionIdeas & Deep AnalysisModels & Research

Key Points

  • The paper introduces D3DR, a method to insert a Gaussian Splatting (3DGS) parametrized object into an existing 3DGS scene while correcting lighting, shadows, and artifacts for visual consistency.
  • It argues that diffusion models trained on large real-world datasets implicitly learn correct scene illumination and uses this capability to guide lighting correction.
  • After insertion, the approach optimizes a diffusion-based Delta Denoising Score (DDS)-inspired objective to adjust the inserted object’s 3D Gaussian parameters.
  • The work proposes a diffusion “personalization” technique that maintains the object’s geometry and texture across varying lighting conditions to preserve identity matching.
  • Experiments show improved relighting quality versus prior methods, reporting about a 2.0 dB PSNR gain.

Abstract

Gaussian Splatting has become a popular technique for various 3D Computer Vision tasks, including novel view synthesis, scene reconstruction, and dynamic scene rendering. However, the challenge of natural-looking object insertion, where the object's appearance seamlessly matches the scene, remains unsolved. In this work, we propose a method, dubbed D3DR, for inserting a 3DGS-parametrized object into a 3DGS scene while correcting its lighting, shadows, and other visual artifacts to ensure consistency. We reveal a hidden ability of diffusion models trained on large real-world datasets to implicitly understand correct scene lighting, and leverage it in our pipeline. After inserting the object, we optimize a diffusion-based Delta Denoising Score (DDS)-inspired objective to adjust its 3D Gaussian parameters for proper lighting correction. We introduce a novel diffusion personalization technique that preserves object geometry and texture across diverse lighting conditions, and utilize it to achieve consistent identity matching between original and inserted objects. Finally, we demonstrate the effectiveness of the method by comparing it to existing approaches, achieving 2.0 dB PSNR improvements in relighting quality.