AI Navigate

Inst4DGS: Instance-Decomposed 4D Gaussian Splatting with Multi-Video Label Permutation Learning

arXiv cs.CV / 3/20/2026

📰 NewsIdeas & Deep AnalysisModels & Research

Key Points

  • Inst4DGS introduces an instance-decomposed 4D Gaussian Splatting framework with long-horizon per-Gaussian trajectories for dynamic scenes.
  • It tackles inconsistent instance labels across independently segmented multi-view videos by learning cross-video matches with per-video label-permutation latents and a differentiable Sinkhorn layer, enabling consistent identity preservation.
  • The approach uses instance-decomposed motion scaffolds that provide low-dimensional motion bases per object to support long-horizon trajectory optimization.
  • Experimental results on the Panoptic Studio and Neural3DV datasets show state-of-the-art rendering and segmentation quality, including PSNR improvements from 26.10 to 28.36 and instance mIoU from 0.6310 to 0.9129.

Abstract

We present Inst4DGS, an instance-decomposed 4D Gaussian Splatting (4DGS) approach with long-horizon per-Gaussian trajectories. While dynamic 4DGS has advanced rapidly, instance-decomposed 4DGS remains underexplored, largely due to the difficulty of associating inconsistent instance labels across independently segmented multi-view videos. We address this challenge by introducing per-video label-permutation latents that learn cross-video instance matches through a differentiable Sinkhorn layer, enabling direct multi-view supervision with consistent identity preservation. This explicit label alignment yields sharp decision boundaries and temporally stable identities without identity drift. To further improve efficiency, we propose instance-decomposed motion scaffolds that provide low-dimensional motion bases per object for long-horizon trajectory optimization. Experiments on Panoptic Studio and Neural3DV show that Inst4DGS jointly supports tracking and instance decomposition while achieving state-of-the-art rendering and segmentation quality. On the Panoptic Studio dataset, Inst4DGS improves PSNR from 26.10 to 28.36, and instance mIoU from 0.6310 to 0.9129, over the strongest baseline.