Boundary Sampling to Learn Predictive Safety Filters via Pontryagin's Maximum Principle

arXiv cs.RO / 4/16/2026

💬 OpinionIdeas & Deep AnalysisModels & Research

Key Points

  • The paper proposes a data-collection strategy for learning safety filters that targets “boundary” trajectories—those that nearly violate constraints—using Pontryagin’s Maximum Principle (PMP).
  • These boundary trajectories are used to guide training for learned Hamilton-Jacobi Reachability, improving sample efficiency by concentrating data near safety-critical states in high-dimensional systems.
  • The resulting learned Control Barrier Value Function can be used directly as a safety filter for autonomous decision-making.
  • Simulations and experiments on shared-control automotive racing show PMP-based sampling improves convergence speed, reduces failure rates, and improves safe-set reconstruction, with reported wall times around 3ms.

Abstract

Safety filters provide a practical approach for enforcing safety constraints in autonomous systems. While learning-based tools scale to high-dimensional systems, their performance depends on informative data that includes states likely to lead to constraint violation, which can be difficult to efficiently sample in complex, high-dimensional systems. In this work, we characterize trajectories that barely avoid safety violations using the Pontryagin Maximum Principle. These boundary trajectories are used to guide data collection for learned Hamilton-Jacobi Reachability, concentrating learning efforts near safety-critical states to improve efficiency. The learned Control Barrier Value Function is then used directly for safety filtering. Simulations and experimental validation on a shared-control automotive racing application demonstrate PMP sampling improves learning efficiency, yielding faster convergence, reduced failure rates, and improved safe set reconstruction, with wall times around 3ms.