VGGT-360: Geometry-Consistent Zero-Shot Panoramic Depth Estimation
arXiv cs.CV / 3/20/2026
📰 NewsIdeas & Deep AnalysisModels & Research
Key Points
- VGGT-360 is a training-free, zero-shot framework for panoramic depth estimation that reformulates the task as panorama-to-3D-to-depth using multi-view reconstructed 3D models and VGGT-style foundation models.
- It introduces three plug-and-play modules: (i) uncertainty-guided adaptive projection slices to convert panoramas into perspective views and allocate more views to geometry-poor regions, (ii) structure-saliency enhanced attention to improve 3D reconstruction robustness and cross-view coherence, and (iii) correlation-weighted 3D model correction to reweight overlapping points based on attention-derived correlations for consistent geometry.
- The approach unifies fragmented per-view reasoning into a coherent panoramic understanding by leveraging intrinsic 3D consistency and bridging domain gaps between panoramic inputs and perspective priors.
- Extensive experiments show VGGT-360 outperforms both trained and training-free state-of-the-art methods across multiple resolutions and diverse indoor and outdoor datasets.
Related Articles

Interactive Web Visualization of GPT-2
Reddit r/artificial
Stop Treating AI Interview Fraud Like a Proctoring Problem
Dev.to
[R] Causal self-attention as a probabilistic model over embeddings
Reddit r/MachineLearning
The 5 software development trends that actually matter in 2026 (and what they mean for your startup)
Dev.to
InVideo AI Review: Fast Finished
Dev.to