2K Retrofit: Entropy-Guided Efficient Sparse Refinement for High-Resolution 3D Geometry Prediction

arXiv cs.CV / 3/23/2026

📰 NewsDeveloper Stack & InfrastructureTools & Practical UsageModels & Research

Key Points

  • 2K Retrofit introduces a framework for efficient 2K-resolution inference on geometric foundation models without modifying or retraining the backbone.
  • The approach combines fast coarse predictions with entropy-based sparse refinement to selectively improve high-uncertainty regions, achieving high fidelity with minimal overhead.
  • Extensive experiments show state-of-the-art accuracy and speed on standard benchmarks, enabling scalable deployment in high-resolution 3D vision tasks for autonomous driving, robotics, and AR/MR.
  • Code will be released upon acceptance, signaling practical availability for researchers and practitioners.

Abstract

High-resolution geometric prediction is essential for robust perception in autonomous driving, robotics, and AR/MR, but current foundation models are fundamentally limited by their scalability to real-world, high-resolution scenarios. Direct inference on 2K images with these models incurs prohibitive computational and memory demands, making practical deployment challenging. To tackle the issue, we present 2K Retrofit, a novel framework that enables efficient 2K-resolution inference for any geometric foundation model, without modifying or retraining the backbone. Our approach leverages fast coarse predictions and an entropy-based sparse refinement to selectively enhance high-uncertainty regions, achieving precise and high-fidelity 2K outputs with minimal overhead. Extensive experiments on widely used benchmark demonstrate that 2K Retrofit consistently achieves state-of-the-art accuracy and speed, bridging the gap between research advances and scalable deployment in high-resolution 3D vision applications. Code will be released upon acceptance.