Bridging Large-Model Reasoning and Real-Time Control via Agentic Fast-Slow Planning

arXiv cs.RO / 4/3/2026

💬 OpinionSignals & Early TrendsIdeas & Deep AnalysisModels & Research

Key Points

  • 提案手法「Agentic Fast-Slow Planning」は、LLMの推論と実時間制御を、時系列の自然なスケールに沿って分離・階層化するフレームワークとして提示されました。
  • Perception2Decisionでは、オンボードのVision-Language Model(VLM)でシーンをエゴ中心のトポロジに圧縮し、クラウド側のLLMで象徴的な運転ディレクティブへ変換して、帯域と遅延を抑えつつ解釈可能性を維持します。
  • Decision2Trajectoryでは、言語由来のソフトコストをSemantic-Guided A*に埋め込んで実行可能な経路を探索し、さらにエージェント的な改良モジュールでハイパーパラメータをフィードバックとメモリで適応させます。
  • 実験(CARLA)では、外乱下で頑健性が向上し、純粋なMPCやA*-誘導型MPC比で車線(横方向)偏差が最大45%減、完了時間が12%以上改善されたと報告されています。
  • コードが公開されており、言語推論と古典的計画・MPC制御を「ブリッジ」で接続する実装可能性が示されています。

Abstract

Large foundation models enable powerful reasoning for autonomous systems, but mapping semantic intent to reliable real-time control remains challenging. Existing approaches either (i) let Large Language Models (LLMs) generate trajectories directly - brittle, hard to verify, and latency-prone - or (ii) adjust Model Predictive Control (MPC) objectives online - mixing slow deliberation with fast control and blurring interfaces. We propose Agentic Fast-Slow Planning, a hierarchical framework that decouples perception, reasoning, planning, and control across natural timescales. The framework contains two bridges. Perception2Decision compresses scenes into ego-centric topologies using an on-vehicle Vision-Language Model (VLM) detector, then maps them to symbolic driving directives in the cloud with an LLM decision maker - reducing bandwidth and delay while preserving interpretability. Decision2Trajectory converts directives into executable paths: Semantic-Guided A* embeds language-derived soft costs into classical search to bias solutions toward feasible trajectories, while an Agentic Refinement Module adapts planner hyperparameters using feedback and memory. Finally, MPC tracks the trajectories in real time, with optional cloud-guided references for difficult cases. Experiments in CARLA show that Agentic Fast-Slow Planning improves robustness under perturbations, reducing lateral deviation by up to 45% and completion time by over 12% compared to pure MPC and an A*-guided MPC baseline. Code is available at https://github.com/cjychenjiayi/icra2026_AFSP.