XRZero-G0: Pushing the Frontier of Dexterous Robotic Manipulation with Interfaces, Quality and Ratios

arXiv cs.RO / 4/15/2026

💬 OpinionSignals & Early TrendsModels & Research

Key Points

  • The paper introduces XRZero-G0, a hardware–software co-designed VR system aimed at scaling dexterous robotic manipulation by collecting high-quality, action-aligned demonstration data without robot dependency.
  • It proposes a closed-loop data pipeline (collection, inspection, training, evaluation) that improves reliability for non-proprioceptive demonstrations and reports an 85% data validity rate with an explicit quality-control mechanism.
  • The authors analyze how robot-free demonstration data scales and identify empirically effective data-mixing ratios, finding that a small amount of real-robot data (e.g., 10:1 robot-free to real-robot) can match performance of purely real-robot datasets.
  • XRZero-G0 reduces data acquisition costs by about twentyfold and uses a 2,000-hour robot-free dataset to achieve zero-shot cross-embodiment transfer to a target physical robot.
  • A public repository is provided, supporting reuse of the system and workflow for embodied data collection and policy-learning research.

Abstract

The acquisition of high-quality, action-aligned demonstration data remains a fundamental bottleneck in scaling foundation models for dexterous robot manipulation. Although robot-free human demonstrations (e.g., the UMI paradigm) offer a scalable alternative to traditional teleoperation, current systems are constrained by sub-optimal hardware ergonomics, open-loop workflows, and a lack of systematic data-mixing strategies. To address these limitations, we present XRZero-G0, a hardware-software co-designed system for embodied data collection and policy learning. The system features an ergonomic, virtual reality interface equipped with a top-view camera and dual specialized grippers to directly improve collection efficiency. To ensure dataset reliability, we propose a closed-loop collection, inspection, training, and evaluation pipeline for non-proprioceptive data. This workflow achieves an 85% data validity rate and establishes a transparent mechanism for quality control. Furthermore, we investigate the empirical scaling behaviors and optimal mixing ratios of robot-free data. Extensive experiments indicate that combining a minimal volume of real-robot data with large-scale robot-free data (e.g., a 10:1 ratio) achieves performance comparable to exclusively real-robot datasets, while reducing acquisition costs by a factor of twenty. Utilizing XRZero-G0, we construct a 2,000-hour robot-free dataset that enables zero-shot cross-embodiment transfer to a target physical robot, demonstrating a highly scalable methodology for generalized real-world manipulation.Our project repository: https://github.com/X-Square-Robot/XRZero-G0