CAM3R: Camera-Agnostic Model for 3D Reconstruction

arXiv cs.CV / 3/25/2026

📰 NewsSignals & Early TrendsIdeas & Deep AnalysisModels & Research

Key Points

  • CAM3R is proposed as a camera-agnostic, feed-forward model that reconstructs dense 3D geometry from unposed wide-angle images (e.g., fisheye/panoramic) without requiring prior camera calibration.
  • The approach uses a two-view network with a Ray Module to estimate per-pixel ray directions and a Cross-view Module to predict radial distances along with confidence maps, pointmaps, and relative poses.
  • A Ray-Aware Global Alignment framework is introduced to refine pose and optimize scale while preserving the model’s predicted local geometry consistency.
  • Experiments across multiple camera-model datasets show improved pose estimation and 3D reconstruction performance, claiming a new state of the art.

Abstract

Recovering dense 3D geometry from unposed images remains a foundational challenge in computer vision. Current state-of-the-art models are predominantly trained on perspective datasets, which implicitly constrains them to a standard pinhole camera geometry. As a result, these models suffer from significant geometric degradation when applied to wide-angle imagery captured via non-rectilinear optics, such as fisheye or panoramic sensors. To address this, we present CAM3R, a Camera-Agnostic, feed-forward Model for 3D Reconstruction capable of processing images from wide-angle camera models without prior calibration. Our framework consists of a two-view network which is bifurcated into a Ray Module (RM) to estimate per-pixel ray directions and a Cross-view Module (CVM) to infer radial distance with confidence maps, pointmaps, and relative poses. To unify these pairwise predictions into a consistent 3D scene, we introduce a Ray-Aware Global Alignment framework for pose refinement and scale optimization while strictly preserving the predicted local geometry. Extensive experiments on various camera model datasets, including panorama, fisheye and pinhole imagery, demonstrate that CAM3R establishes a new state-of-the-art in pose estimation and reconstruction.