SwarmDrive: Semantic V2V Coordination for Latency-Constrained Cooperative Autonomous Driving

arXiv cs.RO / 4/28/2026

📰 NewsDeveloper Stack & InfrastructureSignals & Early TrendsModels & Research

Key Points

  • The paper proposes SwarmDrive, a semantic V2V coordination framework that uses local small language models (SLMs) on nearby vehicles to avoid cloud inference round-trip delays and connectivity dependence.
  • Vehicles share compact intent distributions only when uncertainty is high, and a lightweight event-triggered consensus is used to fuse shared information.
  • In a 5-seed executable study focused on a single occluded intersection scenario, SwarmDrive with a “Swarm 6G” communication setting improves success rate from 68.9% to 94.1% and cuts latency from a 510 ms cloud reference to 151.4 ms.
  • The cooperative gains depend on swarm size and communication quality: more participating vehicles increase overhead and packet loss, and ablation sweeps suggest an effective balance near 4 vehicles with an entropy trigger threshold of 0.65 in the current prototype.
  • The authors emphasize that the results demonstrate feasibility for latency-constrained semantic edge cooperation in the targeted test case, but they are not yet a deployment-grade validation of a real 6G system.

Abstract

Cloud-hosted LLM inference for autonomous driving adds round-trip delay and depends on stable connectivity, while purely local edge models struggle under occlusion. We present SwarmDrive, a semantic Vehicle-to-Vehicle (V2V) coordination framework in which nearby vehicles run local Small Language Models (SLMs), share compact intent distributions only when uncertainty is high, and fuse them through event-triggered consensus. We evaluate SwarmDrive in a 5-seed executable study built around one occluded intersection case, combining matched operating-point comparisons with robustness sweeps. In that setting, SwarmDrive under its 6G communication setting ("Swarm 6G") raises success from 68.9% to 94.1% over a single local SLM while reducing latency from a 510 ms cloud reference to 151.4 ms. However, an increased number of participating vehicles leads to higher communication overhead and packet loss. SwarmDrive also evaluates the impact of swarm-size, packet-loss, and entropy-threshold sweeps and shows that the cooperative gain holds across ablations and is best balanced near an active swarm size of 4 vehicles and an entropy trigger threshold of 0.65 in the current prototype. These results show that semantic edge cooperation can work under tight latency constraints in the targeted intersection case, but they are not a deployment-grade validation of a real 6G stack.