AI Navigate

Need for Speed: Zero-Shot Depth Completion with Single-Step Diffusion

arXiv cs.CV / 3/12/2026

📰 NewsModels & Research

Key Points

  • Marigold-SSD is a single-step, late-fusion depth completion framework that leverages diffusion priors while eliminating costly test-time optimization typical of diffusion-based methods.
  • The approach shifts computational burden from inference to finetuning, enabling efficient and robust 3D perception under real-world latency constraints.
  • It achieves significantly faster inference with a training cost of only 4.5 GPU days.
  • The method is evaluated across four indoor and two outdoor benchmarks, showing strong cross-domain generalization and zero-shot performance compared to existing depth completion approaches.
  • The work also analyzes performance under varying input sparsity levels, challenging common evaluation protocols and highlighting efficiency gains over discriminative models.

Abstract

We introduce Marigold-SSD, a single-step, late-fusion depth completion framework that leverages strong diffusion priors while eliminating the costly test-time optimization typically associated with diffusion-based methods. By shifting computational burden from inference to finetuning, our approach enables efficient and robust 3D perception under real-world latency constraints. Marigold-SSD achieves significantly faster inference with a training cost of only 4.5 GPU days. We evaluate our method across four indoor and two outdoor benchmarks, demonstrating strong cross-domain generalization and zero-shot performance compared to existing depth completion approaches. Our approach significantly narrows the efficiency gap between diffusion-based and discriminative models. Finally, we challenge common evaluation protocols by analyzing performance under varying input sparsity levels. Page: https://dtu-pas.github.io/marigold-ssd/