Sim-to-Real of Humanoid Locomotion Policies via Joint Torque Space Perturbation Injection

arXiv cs.RO / 2026/3/24

💬 オピニオンSignals & Early TrendsIdeas & Deep AnalysisModels & Research

要点

  • The paper introduces a new sim-to-real training approach for humanoid locomotion policies by injecting state-dependent perturbations directly into simulated joint torque inputs.
  • Instead of relying on domain randomization over a fixed finite parameter set, the method targets a wider range of “reality gaps” without requiring extra training runs.
  • Neural networks are used as flexible perturbation generators to model complex, nonlinear and state-varying uncertainties such as actuator dynamics and contact compliance.
  • Experiments report improved robustness of the resulting policies against complex, previously unseen reality gaps in both simulation and real-world deployment.

Abstract

This paper proposes a novel alternative to existing sim-to-real methods for training control policies with simulated experiences. Unlike prior methods that typically rely on domain randomization over a fixed finite set of parameters, the proposed approach injects state-dependent perturbations into the input joint torque during forward simulation. These perturbations are designed to simulate a broader spectrum of reality gaps than standard parameter randomization without requiring additional training. By using neural networks as flexible perturbation generators, the proposed method can represent complex, state-dependent uncertainties, such as nonlinear actuator dynamics and contact compliance, that parametric randomization cannot capture. Experimental results demonstrate that the proposed approach enables humanoid locomotion policies to achieve superior robustness against complex, unseen reality gaps in both simulation and real-world deployment.