Human Pose Estimation in Trampoline Gymnastics: Improving Performance Using a New Synthetic Dataset
arXiv cs.CV / 4/3/2026
📰 NewsSignals & Early TrendsIdeas & Deep AnalysisModels & Research
Key Points
- The study addresses poor human pose estimation performance in trampoline gymnastics, where athletes exhibit extreme poses and unusual multi-view viewpoints.
- Researchers introduce a new synthetic dataset, STP, generated from motion-capture trampoline routines by fitting noisy mocap to a parametric human model and rendering realistic multiview images.
- A ViTPose model is fine-tuned on STP, and the improved 2D keypoint accuracy carries over to better 3D pose reconstruction via triangulation.
- On challenging real multi-view trampoline images, the fine-tuned model achieves state-of-the-art 2D results and reduces 3D MPJPE by 12.5 mm (a 19.6% improvement over the pretrained ViTPose).
- The work narrows the performance gap between “common” pose scenarios and highly atypical gymnastics poses, demonstrating the value of synthetic data for domain-specific perception.




