TAIL-Safe: Task-Agnostic Safety Monitoring for Imitation Learning Policies

arXiv cs.RO / 5/5/2026

📰 NewsDeveloper Stack & InfrastructureSignals & Early TrendsModels & Research

共有:

Key Points

The paper introduces TAIL-Safe, a task-agnostic safety monitoring method for imitation learning (IL) policies that can fail due to initial-condition sensitivity and irreducible approximation errors.
TAIL-Safe learns a Lipschitz-continuous Q-value function that outputs a long-term safety score for state-action pairs using three short-term, task-agnostic criteria: visibility, recognizability, and graspability.
The method defines a “safe set” as the zero-superlevel set of the Q-function and triggers intervention when the nominal IL policy proposes actions outside this set.
When unsafe actions are proposed, TAIL-Safe applies a recovery mechanism inspired by Nagumo’s theorem, using gradient ascent on the Q-function to steer back toward safety.
Using a high-fidelity digital twin built with Gaussian Splatting, the authors collect failure data safely and show on a Franka Emika robot that flow-matching policies can achieve consistent success under runtime perturbations when guided by TAIL-Safe.

Abstract

Recent imitation learning (IL) algorithms such as flow-matching and diffusion policies demonstrate remarkable performance in learning complex manipulation tasks. However, these policies often fail even when operating within their training distribution due to extreme sensitivity to initial conditions and irreducible approximation errors that lead to compounding drift. This makes it unsafe to deploy IL policies in the field where out-of-distribution scenarios are prevalent. A prerequisite for safe deployment is enabling the policy to determine whether it can execute a task the way it was learned from demonstrations. This paper presents TAIL-Safe, a principled approach to identify, for a trained IL policy, a safe set from where the policy empirically succeeds in completing the learned task. We propose a Lipschitz-continuous Q-value function that maps state-action pairs to a long-term safety score based on three short-term task-agnostic criteria: visibility, recognizability, and graspability. The zero-superlevel set of this function characterizes an empirical control invariant set over state-action pairs. When the nominal policy proposes an action outside this set, we apply a recovery mechanism inspired by Nagumo's theorem that uses gradient ascent to the Q-function to steer the policy back to safety. To learn this Q-function, we construct a high-fidelity digital twin using Gaussian Splatting that enables systematic collection of failure data without risk to physical hardware. Experiments with a Franka Emika robot demonstrate that flow-matching policies, which fail under run-time perturbations, achieve consistent task success when guided by the proposed TAIL-Safe.