Closed-Loop Verbal Reinforcement Learning for Task-Level Robotic Planning
arXiv cs.RO / 2026/3/24
💬 オピニオンSignals & Early TrendsIdeas & Deep AnalysisModels & Research
要点
- The paper introduces a closed-loop Verbal Reinforcement Learning (VRL) framework for interpretable task-level robotic planning under execution uncertainty.
- It refines executable Behavior Trees by using an LLM actor guided by structured natural-language feedback from a Vision-Language Model critic that analyzes the robot’s observations and execution traces.
- Unlike conventional gradient-based reinforcement learning, VRL updates policies directly at the symbolic planning level without gradient optimization, aiming for transparency and explicit causal feedback.
- The framework is validated on a real mobile robot completing a multi-stage manipulation-and-navigation task, showing explainable policy improvements and adaptation to execution failures.

