InCaRPose: In-Cabin Relative Camera Pose Estimation Model and Dataset
arXiv cs.CV / 4/7/2026
📰 NewsSignals & Early TrendsModels & Research
Key Points
- InCaRPose introduces a Transformer-based model for robust relative camera pose estimation aimed at in-cabin automotive monitoring and camera extrinsic calibration under severe distortion (e.g., fisheye lenses).
- The method uses frozen backbone features (DINOv3) with a Transformer decoder to infer geometric relationships between reference and target images in a single inference step, including absolute metric-scale translation within realistic mount-adjustment bounds.
- To handle highly distorted automotive interiors, the approach is trained exclusively on synthetic data and is designed to generalize to real-world cabins without requiring identical camera intrinsics.
- The paper reports competitive results on the public 7-Scenes dataset and maintains high rotation/translation precision even with a ViT-Small backbone, targeting real-time use cases such as driver monitoring in (supervised) autonomous driving.
- Alongside the model, the authors release the In-Cabin-Pose dataset of highly distorted vehicle-interior images and provide code on GitHub.
Related Articles

Black Hat Asia
AI Business
v0.20.5
Ollama Releases

Inside Anthropic's Project Glasswing: The AI Model That Found Zero-Days in Every Major OS
Dev.to
Gemma 4 26B fabricated an entire code audit. I have the forensic evidence from the database.
Reddit r/LocalLLaMA
SoloEngine: Low-Code Agentic AI Development Platform with Native Support for Multi-Agent Collaboration, MCP, and Skill System
Dev.to