Which Reconstruction Model Should a Robot Use? Routing Image-to-3D Models for Cost-Aware Robotic Manipulation
arXiv cs.RO / 3/31/2026
💬 OpinionSignals & Early TrendsIdeas & Deep AnalysisModels & Research
Key Points
- The paper addresses how robots should choose among multiple Image-to-3D reconstruction methods with different cost–quality tradeoffs for tasks requiring either fine detail or coarse, collision-safe geometry.
- It proposes SCOUT, a routing framework that separates reconstruction scoring into (1) viewpoint-dependent model performance modeled by a learned probability distribution and (2) an overall image difficulty estimate via a scalar partition function.
- SCOUT is designed so that view-invariant reconstruction pipelines can be added, removed, or reconfigured without retraining because the learned component only operates over viewpoint-dependent models.
- The framework supports arbitrary, multi-dimensional cost constraints at inference time, making it suitable for real robotic systems where compute, latency, and quality requirements vary.
- Experiments on several 3D reconstruction datasets and robotic grasping/dexterous manipulation show consistent improvements over routing baselines, and the authors release code and additional results.
Related Articles

Black Hat Asia
AI Business
[D] How does distributed proof of work computing handle the coordination needs of neural network training?
Reddit r/MachineLearning

Claude Code's Entire Source Code Was Just Leaked via npm Source Maps — Here's What's Inside
Dev.to

BYOK is not just a pricing model: why it changes AI product trust
Dev.to

AI Citation Registries and Identity Persistence Across Records
Dev.to