Differentiable Simulation of Hard Contacts with Soft Gradients for Learning and Control

arXiv cs.RO / 3/24/2026

💬 OpinionIdeas & Deep AnalysisModels & Research

Key Points

  • The paper addresses how discontinuous contact forces limit gradient-based robot learning and control when using simulators with automatic differentiation.
  • It finds that penalty-based approaches (e.g., softening contact resolution) can enable gradients, but “hard contact” regimes cause gradient degradation due to stiff solver settings and incorrect gradients under AD.
  • The authors propose DiffMJX, which improves gradient accuracy by coupling adaptive time integration with penalty-based simulation to better handle hard contacts.
  • They also tackle vanishing contact gradients after separation by introducing Contact from Distance (CFD), using straight-through estimation in the backward pass to keep informative pre-contact gradients while preserving physical realism.

Abstract

Contact forces introduce discontinuities into robot dynamics that severely limit the use of simulators for gradient-based optimization. Penalty-based simulators such as MuJoCo, soften contact resolution to enable gradient computation. However, realistically simulating hard contacts requires stiff solver settings, which leads to incorrect simulator gradients when using automatic differentiation. Contrarily, using non-stiff settings strongly increases the sim-to-real gap. We analyze penalty-based simulators to pinpoint why gradients degrade under hard contacts. Building on these insights, we propose DiffMJX, which couples adaptive time integration with penalty-based simulation to substantially improve gradient accuracy. A second challenge is that contact gradients vanish when bodies separate. To address this, we introduce contacts from distance (CFD) which combines penalty-based simulation with straight-through estimation. By applying CFD exclusively in the backward pass, we obtain informative pre-contact gradients while retaining physical realism.