Faster 3D Gaussian Splatting Convergence via Structure-Aware Densification

arXiv cs.CV / 5/1/2026

📰 NewsIdeas & Deep AnalysisModels & Research

Key Points

  • 3D Gaussian Splatting’s standard adaptive densification can confuse true geometric errors with frequency aliasing, causing either over-blurring or excessive, inefficient densification.
  • The paper proposes a structure-aware densification method that compares a Gaussian’s projected screen-space extent with local texture structure to decide when to subdivide.
  • It introduces a multi-scale frequency analysis (using structure tensors plus Laplacian scale space) to estimate dominant per-pixel frequency, then defines a per-Gaussian, per-axis “frequency violation” metric (η) to detect under-resolved texture details.
  • Instead of isotropic splitting, it performs anisotropic splitting based on which axes have high η, and adds a multiview consistency check to aggregate evidence across viewpoints.
  • By densifying earlier and more effectively, the method skips slow iterative densification cycles and achieves faster convergence and better reconstruction quality, especially for high-frequency regions.

Abstract

3D Gaussian Splatting has emerged as a powerful scene representation for real-time novel-view synthesis. However, its standard adaptive density control relies on screen-space positional gradients, which do not distinguish between geometric misplacement and frequency aliasing, often leading to either over-blurred high-frequency textures or inefficient over-densification. We present a structure-aware densification framework. Our key insight is that the decision to subdivide a Gaussian should be driven by an explicit comparison between its projected screen-space extent and the local structure of the texture it seeks to represent. We introduce a multi-scale frequency analysis combining structure tensors with Laplacian scale space analysis to estimate the dominant frequency at each pixel, enabling robust supervision across varying texture scales. Based on this analysis, we define \eta, a per-Gaussian, per-axis frequency violation metric that indicates when a primitive may be under-resolving local texture details. Unlike methods that perform isotropic splitting (e.g., splitting each Gaussian into two smaller ones with uniform shape), our approach performs anisotropic splitting. For each axis with high \eta, we compute a split factor to better resolve the local frequency content. We further introduce a multiview consistency criterion that aggregates \eta observations across multiple views. By performing densification early and faster, we skip the lengthy iterative densification phases required by baseline methods and achieve significantly faster convergence. Experiments on standard benchmarks demonstrate that our method also achieves superior reconstruction quality, particularly in high-frequency regions.