TrafficClaw: Generalizable Urban Traffic Control via Unified Physical Environment Modeling
arXiv cs.AI / 4/21/2026
📰 NewsDeveloper Stack & InfrastructureModels & Research
Key Points
- The paper argues that urban traffic control needs system-level generalization by using a unified physical environment that couples heterogeneous subsystems (signals, freeways, transit, taxis) instead of treating them as isolated tasks.
- It proposes TrafficClaw, which integrates multiple traffic subsystems into a shared dynamical runtime to model cross-subsystem interactions and enable closed-loop feedback between agents and the environment.
- TrafficClaw builds an LLM-based agent with executable spatiotemporal reasoning and reusable procedural memory to perform unified diagnostics across subsystems and iteratively refine strategies.
- The approach includes a multi-stage training pipeline combining supervised initialization with agentic reinforcement learning plus system-level optimization, aiming for coordinated and system-aware performance.
- Experiments reportedly show robust, transferable, and system-aware results on previously unseen traffic scenarios, dynamics, and task configurations, and the project is released on GitHub.



