Rendering Multi-Human and Multi-Object with 3D Gaussian Splatting
arXiv cs.CV / 4/6/2026
📰 NewsSignals & Early TrendsIdeas & Deep AnalysisModels & Research
Key Points
- The paper addresses “Multi-Human Multi-Object” (MHMO) rendering, aiming to reconstruct dynamic scenes with multiple interacting people and objects from sparse-view inputs for applications like robotics and VR/AR digital twins.
- It identifies two core challenges: maintaining view-consistent representations for each instance under heavy mutual occlusion, and explicitly modeling combinatorial dependencies created by inter-instance interactions.
- To tackle this, the authors propose MM-GS, a hierarchical framework based on 3D Gaussian Splatting with a per-instance multi-view fusion step for consistent instance representations.
- MM-GS also introduces a scene-level instance interaction module that uses a global scene graph to reason about relationships and refine instance attributes to better capture subtle contact and interaction effects.
- Experiments on challenging datasets show the method achieves state-of-the-art performance, improving over strong baselines with higher-fidelity details and more plausible inter-instance contacts.
Related Articles

Black Hat Asia
AI Business
How Bash Command Safety Analysis Works in AI Systems
Dev.to
How I Built an AI Agent That Earns USDC While I Sleep — A Complete Guide
Dev.to
How to Get Better Output from AI Tools (Without Burning Time and Tokens)
Dev.to
How I Added LangChain4j Without Letting It Take Over My Spring Boot App
Dev.to