RAD-LAD: Rule and Language Grounded Autonomous Driving in Real-Time

arXiv cs.RO / 3/31/2026

📰 NewsSignals & Early TrendsIdeas & Deep AnalysisModels & Research

Key Points

  • The paper introduces LAD, a real-time language-action planner with an interruptible architecture that outputs motion plans in a single forward pass at about 20 Hz, or provides textual reasoning alongside motion planning at about 10 Hz.
  • LAD reportedly reduces latency by roughly 3x compared with prior driving language models and achieves state-of-the-art results on nuPlan Test14-Hard and InterPlan using learning-based methods.
  • The authors also propose RAD, a rule-based planner intended to overcome structural limitations of PDM-Closed, achieving leading performance among rule-based planners on the same nuPlan benchmarks.
  • By combining RAD and LAD, the work demonstrates hybrid planning that leverages rules for reliable maneuvering and language for adaptive, explainable decision-making.
  • Overall, the contribution positions interruptible language-driven planning and rule-based safety/structure as complementary components for autonomous driving in closed-loop settings.

Abstract

We present LAD, a real-time language--action planner with an interruptible architecture that produces a motion plan in a single forward pass (~20 Hz) or generates textual reasoning alongside a motion plan (~10 Hz). LAD is fast enough for real-time closed-loop deployment, achieving ~3x lower latency than prior driving language models while setting a new learning-based state of the art on nuPlan Test14-Hard and InterPlan. We also introduce RAD, a rule-based planner designed to address structural limitations of PDM-Closed. RAD achieves state-of-the-art performance among rule-based planners on nuPlan Test14-Hard and InterPlan. Finally, we show that combining RAD and LAD enables hybrid planning that captures the strengths of both approaches. This hybrid system demonstrates that rules and learning provide complementary capabilities: rules support reliable maneuvering, while language enables adaptive and explainable decision-making.