Learn Weightlessness: Imitate Non-Self-Stabilizing Motions on Humanoid Robot

arXiv cs.RO / 4/24/2026

💬 OpinionDeveloper Stack & InfrastructureIndustry & Market MovesModels & Research

Key Points

  • The paper addresses a gap in humanoid whole-body control by targeting environment-dependent, non-self-stabilizing (NSS) motions that require interacting with the surroundings.
  • It proposes a biologically inspired “weightlessness” mechanism where specific joints are selectively relaxed so the robot can use passive contact to stabilize itself and complete the intended motion.
  • The authors introduce a weightlessness-state auto-labeling strategy for dataset annotation and a Weightlessness Mechanism (WM) that automatically decides which joints to relax and to what extent during execution.
  • Experiments across three tasks (chair sitting at different heights, bed lying with different inclinations, and wall leaning using shoulder or elbow) show strong generalization in both simulation and on the Unitree G1 robot, using only single-action demonstrations with no task-specific tuning.
  • Overall, the work aims to bridge rigid trajectory tracking and adaptive environmental interaction, improving stability and contact-rich behavior in humanoid robots.

Abstract

The integration of imitation and reinforcement learning has enabled remarkable advances in humanoid whole-body control, facilitating diverse human-like behaviors. However, research on environment-dependent motions remains limited. Existing methods typically enforce rigid trajectory tracking while neglecting physical interactions with the environment. We observe that humans naturally exploit a "weightless" state during non-self-stabilizing (NSS) motions--selectively relaxing specific joints to allow passive body--environment contact, thereby stabilizing the body and completing the motion. Inspired by this biological mechanism, we design a weightlessness-state auto-labeling strategy for dataset annotation; and we propose the Weightlessness Mechanism (WM), a method that dynamically determines which joints to relax and to what level, together enabling effective environmental interaction while executing target motions. We evaluate our approach on 3 representative NSS tasks: sitting on chairs of varying heights, lying down on beds with different inclinations, and leaning against walls via shoulder or elbow. Extensive experiments in simulation and on the Unitree G1 robot demonstrate that our WM method, trained on single-action demonstrations without any task-specific tuning, achieves strong generalization across diverse environmental configurations while maintaining motion stability. Our work bridges the gap between precise trajectory tracking and adaptive environmental interaction, offering a biologically-inspired solution for contact-rich humanoid control.