Towards Safe and Robust Autonomous Vehicle Platooning: A Self-Organizing Cooperative Control Framework

arXiv cs.RO / 4/7/2026

💬 OpinionIdeas & Deep AnalysisModels & Research

Key Points

  • The paper addresses the challenge of safe and robust autonomous vehicle (AV) platooning in hybrid traffic with human-driven vehicles (HDVs), where formation control and adaptability are difficult under dynamic conditions.
  • It proposes TriCoD, a twin-world safety-enhanced Data-Model-Knowledge cooperative decision-making framework that combines deep reinforcement learning (DRL) with model-driven methods.
  • TriCoD enables dynamic formation dissolution and reconfiguration using a safety-prioritized twin-world deduction mechanism to better handle emergency scenarios.
  • The framework uses an adaptive switching mechanism to switch between data-driven and model-driven strategies in real time, improving flexibility and efficiency.
  • Reported simulation and hardware-in-the-loop experiments indicate significant gains in safety, robustness, and adaptability for AV platooning.

Abstract

In hybrid traffic environments where human-driven vehicles (HDVs) and autonomous vehicles (AVs) coexist, achieving safe and robust decision-making for AV platooning remains a complex challenge. Existing platooning systems often struggle with dynamic formation management and adaptability, especially under complex and dynamic mixed-traffic conditions. To enhance autonomous vehicle platooning within these hybrid environments, this paper presents TriCoD, a twin-world safety-enhanced Data-Model-Knowledge Triple-Driven Cooperative Decision-making Framework. This framework integrates deep reinforcement learning (DRL) with model-driven approaches, enabling dynamic formation dissolution and reconfiguration through a safety-prioritized twin-world deduction mechanism. The DRL component augments traditional model-driven methods, enhancing both safety and operational efficiency, especially under emergency conditions. Additionally, an adaptive switching mechanism allows the system to seamlessly switch between data-driven and model-driven strategies based on real-time traffic demands, thus optimizing decision-making ability and adaptability. Simulation experiments and hardware-in-the-loop tests demonstrate that the proposed framework significantly improves safety, robustness, and flexibility.