Residual Gaussian Splatting for Ultra Sparse-View CBCT Reconstruction

arXiv cs.CV / 5/1/2026

📰 NewsDeveloper Stack & InfrastructureModels & Research

Key Points

  • The paper addresses an issue in cone-beam CT reconstruction using 3D Gaussian splatting: under ultra sparse-view data, photometric optimization shows spectral bias that causes over-smoothing and loss of high-frequency anatomical details.
  • It proposes Residual Gaussian Splatting (RGS), combining wavelet multi-resolution analysis with 3DGS while handling the mismatch between physical non-negativity of X-ray attenuation and the bipolar nature of wavelet coefficients.
  • RGS uses a spectrally decoupled Gaussian representation that decomposes the volumetric field into a geometric base component and a residual detail component to turn high-frequency fitting into physically consistent residual compensation.
  • The method includes a spectral-spatial collaborative optimization strategy to coordinate geometric anchoring and texture refinement while preventing spectral cross-talk.
  • Experiments on clinical datasets show RGS improves visual fidelity, better preserving details in complex trabecular and vascular structures while suppressing artifacts compared with existing neural rendering baselines.

Abstract

While 3D Gaussian splatting (3DGS) offers explicit and efficient scene representations for cone-beam computed tomography reconstruction, conventional photometric optimization inherently suffers from spectral bias under ultra sparse-view conditions, leading to over-smoothing and a loss of high-frequency anatomical details. Since wavelet transforms provide rich high-frequency information and have been widely utilized to enhance sparse reconstruction, this work integrates wavelet multi-resolution analysis with 3DGS. To circumvent the mathematical mismatch between the strict non-negativity of physical X-ray attenuation and the bipolar nature of high-frequency wavelet coefficients, we propose Residual Gaussian Splatting (RGS). Methodologically, we introduce a spectrally-decoupled Gaussian representation that stratifies the volumetric field into a geometric base component and a residual detail component. This decomposition systematically transforms explicit high-frequency fitting into a physically consistent, implicit residual compensation task. Furthermore, we devise a spectral-spatial collaborative optimization strategy to coordinate the interplay between geometric anchoring and texture refinement, effectively preventing spectral crosstalk. Extensive experiments on clinical datasets demonstrate that RGS enables the reconstructed images to capture highly refined geometric textures. It successfully resolves the trade-off between artifact suppression and detail preservation, yielding superior visual fidelity in complex trabecular and vascular structures compared to existing neural rendering baselines.