Articulat3D: Reconstructing Articulated Digital Twins From Monocular Videos with Geometric and Motion Constraints
arXiv cs.CV / 3/13/2026
📰 NewsIdeas & Deep AnalysisModels & Research
Key Points
- Articulat3D introduces a framework to reconstruct articulated digital twins from monocular videos by jointly enforcing 3D geometric and motion constraints.
- It uses Motion Prior-Driven Initialization leveraging 3D point tracks to exploit the low-dimensional structure of articulated motion and decomposes the scene into multiple rigidly-moving groups via motion bases.
- It refines reconstructions with Geometric and Motion Constraints Refinement, employing learnable kinematic primitives parameterized by a joint axis, a pivot point, and per-frame motion scalars to ensure physical plausibility and temporal coherence.
- The approach achieves state-of-the-art performance on synthetic benchmarks and real-world monocular videos, significantly advancing digital twin feasibility under uncontrolled real-world conditions.




