GHOST: Fast Category-agnostic Hand-Object Interaction Reconstruction from RGB Videos using Gaussian Splatting
arXiv cs.CV / 3/20/2026
📰 NewsModels & Research
Key Points
- GHOST is a fast, category-agnostic framework for reconstructing dynamic hand-object interactions from monocular RGB videos by representing hands and objects as dense 2D Gaussian discs using Gaussian Splatting.
- It introduces three innovations: geometric-prior retrieval with a consistency loss to complete occluded object regions; grasp-aware alignment that refines hand translations and object scale for realistic contact; and a hand-aware background loss to avoid penalizing hand-occluded object regions.
- It achieves complete, physically consistent, and animatable reconstructions and runs an order of magnitude faster than prior category-agnostic methods, with state-of-the-art 3D reconstruction and 2D rendering quality on ARCTIC, HO3D, and in-the-wild datasets.
- Code is publicly available at GitHub.
Related Articles
Edge-to-Cloud Swarm Coordination for heritage language revitalization programs with embodied agent feedback loops
Dev.to
AI Crawler Management: The Definitive Guide to robots.txt for AI Bots
Dev.to

I benchmarked 31 STT models on medical audio — VibeVoice 9B is the new open-source leader at 8.34% WER, but it's big and slow
Reddit r/LocalLLaMA

Anthropic confirms leaked model marks a "step change" in reasoning after data breach reveals its existence
THE DECODER

AI agent accelerates catalyst discovery for sustainable fuel development
Reddit r/artificial