Co-Learning Port-Hamiltonian Systems and Optimal Energy-Shaping Control

arXiv cs.AI / 4/30/2026

💬 OpinionIdeas & Deep AnalysisModels & Research

Key Points

  • The paper introduces a physics-informed learning framework to design energy-shaping control for port-Hamiltonian (pH) systems using only trajectory data.
  • It co-learns a pH system model and an optimal energy-balancing passivity-based controller (EB-PBC) via alternating optimization, where each iteration updates the model from policy-driven data and re-optimizes the controller.
  • Both the learned dynamics model and the controller are parameterized with neural networks that respect pH/EB-PBC structure, aiming to preserve interpretability through energy-interaction terms.
  • The resulting closed-loop controller is designed to be inherently passive and provably stable, while dissipation regularization encourages strict energy decay to improve robustness across simulation-to-real differences.
  • Experiments validate the approach on state-regulation and swing-up control tasks for planar and torsional pendulum systems.

Abstract

We develop a physics-informed learning framework for energy-shaping control of port-Hamiltonian (pH) systems from trajectory data. The proposed approach {co-learns} a pH system model and an optimal energy-balancing passivity-based controller (EB-PBC) through alternating optimization with policy-aware data collection. At each iteration, the system model is refined using trajectory data collected under the current control policy, and the controller is re-optimized on the updated model. Both components are parameterized by neural networks that embed the pH {dynamics} and EB-PBC structure, ensuring interpretability in terms of energy {interactions}. The learned controller renders the closed-loop system inherently passive and provably stable, and exploits passive plant dynamics without canceling the natural potential. A dissipation regularization enforces strict energy decay during training, thereby enhancing robustness to sim-to-real gaps. The proposed framework is validated on state-regulation and swing-up tasks for planar and torsional pendulum systems.