Articulat3D: Reconstructing Articulated Digital Twins From Monocular Videos with Geometric and Motion Constraints
arXiv cs.CV / 3/13/2026
📰 NewsIdeas & Deep AnalysisModels & Research
Key Points
- Articulat3D introduces a framework to reconstruct articulated digital twins from monocular videos by jointly enforcing 3D geometric and motion constraints.
- It uses Motion Prior-Driven Initialization leveraging 3D point tracks to exploit the low-dimensional structure of articulated motion and decomposes the scene into multiple rigidly-moving groups via motion bases.
- It refines reconstructions with Geometric and Motion Constraints Refinement, employing learnable kinematic primitives parameterized by a joint axis, a pivot point, and per-frame motion scalars to ensure physical plausibility and temporal coherence.
- The approach achieves state-of-the-art performance on synthetic benchmarks and real-world monocular videos, significantly advancing digital twin feasibility under uncontrolled real-world conditions.
Related Articles

Interactive Web Visualization of GPT-2
Reddit r/artificial
Stop Treating AI Interview Fraud Like a Proctoring Problem
Dev.to
[R] Causal self-attention as a probabilistic model over embeddings
Reddit r/MachineLearning
The 5 software development trends that actually matter in 2026 (and what they mean for your startup)
Dev.to
InVideo AI Review: Fast Finished
Dev.to