A Physical Imitation Learning Pipeline for Energy-Efficient Quadruped Locomotion Assisted by Parallel Elastic Joint

arXiv cs.RO / 4/2/2026

💬 OpinionSignals & Early TrendsIdeas & Deep AnalysisModels & Research

Key Points

  • The paper proposes a Physical Imitation Learning (PIL) pipeline that distills a reinforcement learning quadruped control policy into physically realizable joint responses that can be offloaded to passive Parallel Elastic Joints (PEJs).
  • It uses residual motor commands to recover RL performance while outsourcing a substantial portion of mechanical power to the passive elastic elements.
  • Simulation results indicate large energy savings, with up to 87% mechanical power offloaded to PEJs on flat terrain and 18% on rough terrain.
  • The approach performs “brain-body co-design” by distilling from a pre-existing control policy rather than jointly optimizing body design parameters, aiming to reduce computational search complexity.
  • The authors argue this computationally efficient, task-specific Embodied Physical Intelligence method can generalize to other joint-based robot morphologies.

Abstract

Due to brain-body co-evolution, animals' intrinsic body dynamics play a crucial role in energy-efficient locomotion, which shares control effort between active muscles and passive body dynamics -- a principle known as Embodied Physical Intelligence. In contrast, robot bodies are often designed with one centralised controller that typically suppress the intrinsic body dynamics instead of exploiting it. We introduce Physical Imitation Learning (PIL), which distils a Reinforcement Learning (RL) control policy into physically implementable body responses that can be directly offloaded to passive Parallel Elastic Joints (PEJs), enabling therefore the body to imitate part of the controlled behaviour. Meanwhile, the residual policy commands the motors to recover the RL policy's performance. The results is an overall reduced energy consumption thanks to outsourcing parts of the control policy to the PEJs. Here we show in simulated quadrupeds, that our PIL approach can offloads up to 87% of mechanical power to PEJs on flat terrain and 18% on rough terrain. Because the body design is distilled from -- rather than jointly optimised with -- the control policy, PIL realises brain-body co-design without expanding the search space with body design parameters, providing a computationally efficient route to task-specific Embodied Physical Intelligence applicable to a wide range of joint-based robot morphologies.