CuraLight: Debate-Guided Data Curation for LLM-Centered Traffic Signal Control
arXiv cs.AI / 4/8/2026
💬 OpinionSignals & Early TrendsIdeas & Deep AnalysisModels & Research
Key Points
- The paper introduces CuraLight, an LLM-centered framework for traffic signal control that uses an RL agent to explore traffic environments and collect high-quality interaction trajectories for training data.
- CuraLight converts the RL-generated trajectories into prompt-response pairs and applies imitation fine-tuning, aiming to improve interpretability and reduce the need for large amounts of interaction data.
- It adds a multi-LLM ensemble “deliberation” mechanism that uses structured debate to evaluate candidate signal timing actions and produce preference-aware supervision signals.
- Experiments in SUMO across heterogeneous networks (Jinan, Hangzhou, Yizhuang) show consistent performance gains over state-of-the-art baselines, including 5.34% lower average travel time, 5.14% shorter average queue length, and 7.02% reduced waiting time.
- The study argues that combining RL-assisted exploration with debate-based data curation can yield scalable and more interpretable LLM-driven traffic signal strategies that generalize better across varied intersections.




