A Dataset and Evaluation for Complex 4D Markerless Human Motion Capture
arXiv cs.CV / 4/15/2026
📰 NewsSignals & Early TrendsModels & Research
Key Points
- The paper introduces a new dataset and evaluation benchmark for complex 4D markerless human motion capture, designed to better reflect real-world challenges like multi-person interactions and heavy occlusions.
- The dataset includes synchronized multi-view RGB and depth sequences with accurate camera calibration, ground-truth 3D motion from a Vicon system, and corresponding SMPL/SMPL-X parameters for tightly aligned supervision.
- It covers both single- and multi-person scenarios featuring intricate motions, rapid position exchanges between similarly dressed subjects, varying subject distances, and frequent inter-person occlusions.
- Benchmark results show that current state-of-the-art markerless 4D MoCap models experience substantial performance degradation when tested under these realistic conditions, revealing a persistent domain gap.
- The authors report that targeted fine-tuning can improve generalization, suggesting the dataset is effective for driving more robust model development.
Related Articles

Black Hat Asia
AI Business
Are gamers being used as free labeling labor? The rise of "Simulators" that look like AI training grounds [D]
Reddit r/MachineLearning

I built a trading intelligence MCP server in 2 days — here's how
Dev.to

Big Tech firms are accelerating AI investments and integration, while regulators and companies focus on safety and responsible adoption.
Dev.to
Qwen3.5-35B running well on RTX4060 Ti 16GB at 60 tok/s
Reddit r/LocalLLaMA