DAG-STL: A Hierarchical Framework for Zero-Shot Trajectory Planning under Signal Temporal Logic Specifications

arXiv cs.RO / 4/21/2026

📰 NewsDeveloper Stack & InfrastructureModels & Research

Key Points

  • The paper introduces DAG-STL, a hierarchical framework for zero-shot trajectory planning under Signal Temporal Logic (STL) specifications when system dynamics and environment structure are unknown.
  • DAG-STL separates logical reasoning from trajectory realization by decomposing long-horizon STL goals into reachability/invariance progress conditions, then allocating timed waypoints using learned reachability-time estimates.
  • It synthesizes trajectories between allocated waypoints with a diffusion-based generator, turning global planning into shorter subproblems that are easier to solve.
  • To connect planning correctness with real-world feasibility, the work proposes a rollout-free dynamic consistency metric and an anytime refinement search to improve multiple waypoint-allocation hypotheses within limited compute budgets, along with hierarchical online replanning for execution-time recovery.
  • Experiments on Maze2D, OGBench AntMaze, and Cube (plus a custom reference-based environment) show that DAG-STL substantially improves over direct robustness-guided diffusion and generalizes across navigation and manipulation tasks while keeping computational advantages over explicit-model optimization.

Abstract

Signal Temporal Logic (STL) is a powerful language for specifying temporally structured robotic tasks. Planning executable trajectories under STL constraints remains difficult when system dynamics and environment structure are not analytically available. Existing methods typically either assume explicit models or learn task-specific behaviors, limiting zero-shot generalization to unseen STL tasks. In this work, we study offline STL planning under unknown dynamics using only task-agnostic trajectory data. Our central design philosophy is to separate logical reasoning from trajectory realization. We instantiate this idea in DAG-STL, a hierarchical framework that converts long-horizon STL planning into three stages. It first decomposes an STL formula into reachability and invariance progress conditions linked by shared timing constraints. It then allocates timed waypoints using learned reachability-time estimates. Finally, it synthesizes trajectories between these waypoints with a diffusion-based generator. This decomposition--allocation--generation pipeline reduces global planning to shorter, better-supported subproblems. To bridge the gap between planning-level correctness and execution-level feasibility, we further introduce a rollout-free dynamic consistency metric, an anytime refinement search procedure for improving multiple allocation hypotheses under finite budgets, and a hierarchical online replanning mechanism for execution-time recovery. Experiments in Maze2D, OGBench AntMaze, and the Cube domain show that DAG-STL substantially outperforms direct robustness-guided diffusion on complex long-horizon STL tasks and generalizes across navigation and manipulation settings. In a custom environment with an optimization-based reference, DAG-STL recovers most model-solvable tasks while retaining a clear computational advantage over direct optimization based on the explicit system model.