Safety Guarantees in Zero-Shot Reinforcement Learning for Cascade Dynamical Systems

arXiv cs.AI / 4/14/2026

💬 OpinionIdeas & Deep AnalysisModels & Research

Key Points

  • The paper studies how to obtain zero-shot safety guarantees for cascade dynamical systems, where inner states influence outer states but not vice versa.
  • It defines safety as staying within a high-probability “safe set” for all time, and proposes training a safe RL policy on a reduced-order model that ignores inner-state dynamics while modeling their effect via actions.
  • For deployment in the full system, the RL policy is paired with a low-level controller that tracks the RL-provided reference, separating high-level decision-making from real-time stabilization.
  • The main theoretical contribution is a probabilistic bound on remaining safe after zero-shot deployment in the full-order system, explicitly linking safety to both the inner-state tracking quality and the deployment-time behavior.
  • Experiments on a quadrotor navigation task show that preserving safety guarantees depends on the low-level controller’s bandwidth and tracking performance.

Abstract

This paper considers the problem of zero-shot safety guarantees for cascade dynamical systems. These are systems where a subset of the states (the inner states) affects the dynamics of the remaining states (the outer states) but not vice-versa. We define safety as remaining on a set deemed safe for all times with high probability. We propose to train a safe RL policy on a reduced-order model, which ignores the dynamics of the inner states, but it treats it as an action that influences the outer state. Thus, reducing the complexity of the training. When deployed in the full system the trained policy is combined with a low-level controller whose task is to track the reference provided by the RL policy. Our main theoretical contribution is a bound on the safe probability in the full-order system. In particular, we establish the interplay between the probability of remaining safe after the zero-shot deployment and the quality of the tracking of the inner states. We validate our theoretical findings on a quadrotor navigation task, demonstrating that the preservation of the safety guarantees is tied to the bandwidth and tracking capabilities of the low-level controller.