Online Safety Filter for Deformable Object Manipulation with Horizon Agnostic Neural Operators

arXiv cs.RO / 5/5/2026

📰 NewsDeveloper Stack & InfrastructureIdeas & Deep AnalysisModels & Research

Key Points

  • The paper addresses a key gap in robotic manipulation with deformable media by proposing safety enforcement that directly satisfies constraints rather than relying on reward shaping proxies.
  • It introduces a constraint-driven online safety filter that minimally modifies any nominal control policy in real time to enforce explicit task-level safety constraints.
  • The method uses a horizon-agnostic neural operator to learn the boundary input-output mapping of underlying PDE dynamics and to generalize across different rollout lengths without retraining.
  • Safety is certified at task-relevant outputs using a boundary control barrier function solved via a lightweight quadratic program for real-time operation.
  • Experiments on FluidLab fluid manipulation show up to a 22% improvement in safe trajectory rates versus unfiltered policies and fewer steps to reach the safe set, indicating both reliability and efficiency.

Abstract

Safety critical control of robotic manipulation tasks involving deformable media such as fluids, cloth, and soft objects remains challenging because existing learning based approaches encode safety indirectly through reward shaping, which provides no guarantee of constraint satisfaction at deployment. We present a constraint driven online safety filter for deformable object manipulation that enforces explicit task level safety constraints in real time by minimally modifying any nominal control policy. Our approach combines two key components: a horizon agnostic neural operator that learns the boundary input output mapping of the underlying PDE dynamics and generalizes across variable rollout lengths without retraining, and a boundary control barrier function that certifies safety at the task relevant output level via a lightweight quadratic program. The resulting safety constraint is affine in the boundary input rate, enabling real time online filtering. We evaluate the proposed method on fluid manipulation tasks in FluidLab, where the filter improves safe trajectory rates by up to 22% over unfiltered base policies while also reducing the number of steps required to reach the safe set, demonstrating that constraint driven safety enforcement is both more reliable and more efficient than reward shaping approaches.