asRoBallet: Closing the Sim2Real Gap via Friction-Aware Reinforcement Learning for Underactuated Spherical Dynamics

arXiv cs.RO / 4/29/2026

📰 NewsDeveloper Stack & InfrastructureTools & Practical UsageModels & Research

Key Points

  • The paper introduces asRoBallet, claiming the first successful reinforcement learning deployment on humanoid ballbot hardware, targeting the long-standing Sim2Real “reality gap” from imperfect friction and contact modeling.
  • It argues that prior work enabled 3D balancing with LQR/MPC, but RL-to-hardware is blocked by missing contact dynamics, actuator latency/jitter, and the need for safe exploration on real systems.
  • To address this, the authors build a high-fidelity MuJoCo simulation that explicitly models ETH-type omni-wheel roller mechanics, including parasitic vibrations and contact discontinuities.
  • They develop a Friction-Aware RL framework designed to achieve zero-shot Sim2Real transfer by learning coupled friction effects across wheel–sphere and sphere–ground interfaces.
  • The system is also supported by a cost-conscious robot design (subtractive reconfiguration from an overconstrained quadruped) and an iOS-based low-latency control ecosystem for intuitive operator-driven expressive maneuvers.

Abstract

We introduce asRoBallet, to the best of our knowledge, the first successful deployment of reinforcement learning (RL) on a humanoid ballbot hardware. Historically, ballbots have served as a canonical benchmark for underactuated and nonholonomic control, which are characterized by a reality gap in complex friction models for wheel-sphere-ground interactions. While current literature demonstrates successful handling of 3D balancing with LQR and MPC, transitioning to actual hardware for a humanoid ballbot using RL is currently hindered by critical gaps in contact modeling, actuator latency & jitter, and safe hardware exploration, and safe hardware exploration. This study proposes a high-fidelity MuJoCo simulation that explicitly models the discrete roller mechanics of ETH-type omni-wheels, thereby capturing parasitic vibrations and contact discontinuities that are previously ignored. We also developed a Friction-Aware Reinforcement Learning framework that achieves zero-shot Sim2Real transfer by mastering the coupled rolling, lateral, and torsional friction channels at the wheel-sphere and sphere-ground interfaces. We designed asRoBallet through subtractive reconfiguration, repurposing key components from an overconstrained quadruped and integrating them into a newly designed structural frame to achieve a robust research platform at low cost. We also developed a generalized iOS ecosystem that transforms consumer electronics into a low-latency interface, enabling a single operator to orchestrate expressive humanoid maneuvers via intuitive natural motion.