MetaTune: Adjoint-based Meta-tuning via Robotic Differentiable Dynamics

arXiv cs.RO / 3/31/2026

📰 NewsSignals & Early TrendsIdeas & Deep AnalysisModels & Research

Key Points

  • MetaTune is a unified framework that jointly auto-tunes feedback controllers and disturbance observers using differentiable closed-loop meta-learning for robotic systems under uncertainty.
  • The method employs a portable neural policy together with physics-informed gradients from differentiable system dynamics, enabling adaptive controller gains across tasks and operating conditions.
  • An adjoint-based technique computes meta-gradients backward in time to directly minimize the cost-to-go, reducing computational complexity from forward-horizon methods to linear in the data horizon.
  • Experiments on quadrotor control report consistent gains over existing differentiable tuning approaches, with gradient computation time reduced by more than 50%.
  • In PX4-Gazebo hardware-in-the-loop simulation, the learned adaptive policy improves tracking error by 15–20% at aggressive speeds, up to 40% under strong disturbances, and achieves zero-shot sim-to-sim transfer without fine-tuning.

Abstract

Disturbance observer-based control has shown promise in robustifying robotic systems against uncertainties. However, tuning such systems remains challenging due to the strong coupling between controller gains and observer parameters. In this work, we propose MetaTune, a unified framework for joint auto-tuning of feedback controllers and disturbance observers through differentiable closed-loop meta-learning. MetaTune integrates a portable neural policy with physics-informed gradients derived from differentiable system dynamics, enabling adaptive gain across tasks and operating conditions. We develop an adjoint method that efficiently computes the meta-gradients with respect to adaptive gains backward in time to directly minimize the cost-to-go. Compared to existing forward methods, our approach reduces the computational complexity to be linear in the data horizon. Experimental results on quadrotor control show that MetaTune achieves consistent improvements over state-of-the-art differentiable tuning methods while reducing gradient computation time by more than 50 percent. In high-fidelity PX4-Gazebo hardware-in-the-loop simulation, the learned adaptive policy yields 15-20 percent average tracking error reduction at aggressive flight speeds and up to 40 percent improvement under strong disturbances, while demonstrating zero-shot sim-to-sim transfer without fine-tuning.