Iteratively Learning Muscle Memory for Legged Robots to Master Adaptive and High Precision Locomotion

arXiv cs.RO / 4/10/2026

💬 OpinionSignals & Early TrendsIdeas & Deep AnalysisModels & Research

Key Points

  • The paper proposes a scalable, adaptive locomotion control framework that combines Iterative Learning Control (ILC) with a biologically inspired “torque library” to mimic muscle memory in legged robots.
  • By using learned torque profiles, the approach improves trajectory tracking accuracy under unmodeled dynamics and external disturbances, and extends ILC beyond periodic gaits to handle nonperiodic tasks.
  • The method is data-enabled: it couples a physics-based hybrid-system model (from trajectory optimization) with real-time learning to compensate for model errors and environmental changes.
  • A key contribution is a generalized torque library that can rapidly adapt to variations in speed, terrain, and gravity without needing to redo learning, reducing online computation during execution.
  • Experiments and simulations on Cassie (biped) and A1 (quadruped) show up to 85% reduction in joint tracking error within seconds, reliable slope/terrain adaptation, and control update rates over 30× faster than prior whole-body controller approaches.

Abstract

This paper presents a scalable and adaptive control framework for legged robots that integrates Iterative Learning Control (ILC) with a biologically inspired torque library (TL), analogous to muscle memory. The proposed method addresses key challenges in robotic locomotion, including accurate trajectory tracking under unmodeled dynamics and external disturbances. By leveraging the repetitive nature of periodic gaits and extending ILC to nonperiodic tasks, the framework enhances accuracy and generalization across diverse locomotion scenarios. The control architecture is data-enabled, combining a physics-based model derived from hybrid-system trajectory optimization with real-time learning to compensate for model uncertainties and external disturbances. A central contribution is the development of a generalized TL that stores learned control profiles and enables rapid adaptation to changes in speed, terrain, and gravitational conditions-eliminating the need for repeated learning and significantly reducing online computation. The approach is validated on the bipedal robot Cassie and the quadrupedal robot A1 through extensive simulations and hardware experiments. Results demonstrate that the proposed framework reduces joint tracking errors by up to 85% within a few seconds and enables reliable execution of both periodic and nonperiodic gaits, including slope traversal and terrain adaptation. Compared to state-of-the-art whole-body controllers, the learned skills eliminate the need for online computation during execution and achieve control update rates exceeding 30x those of existing methods. These findings highlight the effectiveness of integrating ILC with torque memory as a highly data-efficient and practical solution for legged locomotion in unstructured and dynamic environments.