Unicorn: A Universal and Collaborative Reinforcement Learning Approach Towards Generalizable Network-Wide Traffic Signal Control

arXiv cs.RO / 3/26/2026

💬 OpinionSignals & Early TrendsIdeas & Deep AnalysisModels & Research

Key Points

  • The paper proposes “Unicorn,” a universal, collaborative multi-agent reinforcement learning framework aimed at improving network-wide adaptive traffic signal control across heterogeneous real-world traffic networks.
  • It introduces a unified mapping method that converts intersection states/actions from different topologies into a common structure using traffic-movement-based representations.
  • A Universal Traffic Representation (UTR) module with a decoder-only network is used to extract general features that transfer better across differing traffic scenarios.
  • The framework adds an Intersection Specifics Representation (ISR) module using variational inference to capture latent vectors that encode each intersection’s unique topology and dynamics, further refined via self-supervised contrastive learning.
  • To enable effective regional collaboration, Unicorn integrates neighboring agents’ state-action dependencies into policy optimization, improving coordination under dynamic interactions, and releases code on GitHub.

Abstract

Adaptive traffic signal control (ATSC) is crucial in reducing congestion, maximizing throughput, and improving mobility in rapidly growing urban areas. Recent advancements in parameter-sharing multi-agent reinforcement learning (MARL) have greatly enhanced the scalable and adaptive optimization of complex, dynamic flows in large-scale homogeneous networks. However, the inherent heterogeneity of real-world traffic networks, with their varied intersection topologies and interaction dynamics, poses substantial challenges to achieving scalable and effective ATSC across different traffic scenarios. To address these challenges, we present Unicorn, a universal and collaborative MARL framework designed for efficient and adaptable network-wide ATSC. Specifically, we first propose a unified approach to map the states and actions of intersections with varying topologies into a common structure based on traffic movements. Next, we design a Universal Traffic Representation (UTR) module with a decoder-only network for general feature extraction, enhancing the model's adaptability to diverse traffic scenarios. Additionally, we incorporate an Intersection Specifics Representation (ISR) module, designed to identify key latent vectors that represent the unique intersection's topology and traffic dynamics through variational inference techniques. To further refine these latent representations, we employ a contrastive learning approach in a self-supervised manner, which enables better differentiation of intersection-specific features. Moreover, we integrate the state-action dependencies of neighboring agents into policy optimization, which effectively captures dynamic agent interactions and facilitates efficient regional collaboration. [...]. The code is available at https://github.com/marmotlab/Unicorn