Registration-Free Learnable Multi-View Capture of Faces in Dense Semantic Correspondence
arXiv cs.CV / 5/5/2026
📰 NewsDeveloper Stack & InfrastructureModels & Research
Key Points
- The paper introduces MOCHI, a multi-view 3D face prediction framework that learns dense semantic correspondence without needing registered training data.
- MOCHI removes the dependency on slow manual registration by enforcing topological consistency using a pseudo-linear inverse kinematics solver, while semantic alignment is driven by dense keypoints from a 2D landmark predictor trained on synthetic data.
- The authors find that conventional point-to-surface distance losses can cause training instabilities and visual artifacts in registration-free settings, and they propose pointmap- and normal-based losses to improve gradient smoothness and reconstruction quality.
- A test-time optimization method further refines network weights for a few dozen iterations, improving accuracy and visual fidelity beyond purely feed-forward approaches.
- The authors report that MOCHI can outperform traditional labor-intensive registration pipelines and provide public code and models for reproducibility.
Related Articles

Singapore's Fraud Frontier: Why AI Scam Detection Demands Regulatory Precision
Dev.to

First experience with Building Apps with Google AI Studio: Incredibly simple and intuitive.
Dev.to

Meta will use AI to analyze height and bone structure to identify if users are underage
TechCrunch

13 CLAUDE.md Rules That Make AI Write Modern PHP (Not PHP 5 Resurrected)
Dev.to

Building an AI Image Generator SaaS in 2026: My Tech Stack and Lessons
Dev.to