AI Navigate

OpenT2M: No-frill Motion Generation with Open-source,Large-scale, High-quality Data

arXiv cs.CV / 3/20/2026

📰 NewsIdeas & Deep AnalysisModels & Research

Key Points

  • OpenT2M introduces a million-level, high-quality open-source motion dataset with over 2800 hours of human motion to improve generalization in text-to-motion models.
  • The dataset undergoes rigorous quality control, including physical feasibility validation and multi-granularity filtering, with second-wise text annotations.
  • A new pretrained motion model, MonoFrill, uses a novel 2D-PRQ motion tokenizer that divides the body into biological parts to capture spatiotemporal dependencies and achieves strong reconstruction and zero-shot performance.
  • The authors provide an automated pipeline for long-horizon motion generation and expect OpenT2M and MonoFrill to advance T2M benchmarking and data-quality standards.

Abstract

Text-to-motion (T2M) generation aims to create realistic human movements from text descriptions, with promising applications in animation and robotics. Despite recent progress, current T2M models perform poorly on unseen text descriptions due to the small scale and limited diversity of existing motion datasets. To address this problem, we introduce OpenT2M, a million-level, high-quality, and open-source motion dataset containing over 2800 hours of human motion. Each sequence undergoes rigorous quality control through physical feasibility validation and multi-granularity filtering, with detailed second-wise text annotations. We also develop an automated pipeline for creating long-horizon sequences, enabling complex motion generation. Building upon OpenT2M, we introduce MonoFrill, a pretrained motion model that achieves compelling T2M results without complicated designs or technique tricks as "frills". Its core component is 2D-PRQ, a novel motion tokenizer that captures spatiotemporal dependencies by dividing the human body into biology parts. Experiments show that OpenT2M significantly improves generalization of existing T2M models, while 2D-PRQ achieves superior reconstruction and strong zero-shot performance. We expect OpenT2M and MonoFrill will advance the T2M field by addressing longstanding data quality and benchmarking challenges.