Can Users Specify Driving Speed? Bench2Drive-Speed: Benchmark and Baselines for Desired-Speed Conditioned Autonomous Driving

arXiv cs.RO / 3/27/2026

💬 OpinionSignals & Early TrendsIdeas & Deep AnalysisModels & Research

Key Points

  • The paper introduces Bench2Drive-Speed, a new benchmark (dataset, metrics, and baselines) for desired-speed conditioned end-to-end autonomous driving, including explicit user inputs for target speed and follow/overtake instructions.
  • It defines quantitative evaluation metrics—Speed-Adherence Score and Overtake Score—to measure how faithfully a driving policy follows user specifications while staying compatible with standard autonomous-driving metrics.
  • To train speed-conditioned policies without costly expert demonstrations, the authors propose re-annotating existing regular driving data by using speed observed in future frames as the target speed for training.
  • Experiments using the CustomizedSpeedDataset (2,100 annotated clips) show that, with proper re-annotation, models trained from regular driving data can match performance of approaches relying on expert demonstration supervision.
  • The study finds that target-speed following can be achieved without harming standard driving performance, but overtaking commands remain substantially more difficult due to the interactive nature of such maneuvers.

Abstract

End-to-end autonomous driving (E2E-AD) has achieved remarkable progress. However, one practical and useful function has been long overlooked: users may wish to customize the desired speed of the policy or specify whether to allow the autonomous vehicle to overtake. To bridge this gap, we present Bench2Drive-Speed, a benchmark with metrics, dataset, and baselines for desired-speed conditioned autonomous driving. We introduce explicit inputs of users' desired target-speed and overtake/follow instructions to driving policy models. We design quantitative metrics, including Speed-Adherence Score and Overtake Score, to measure how faithfully policies follow user specifications, while remaining compatible with standard autonomous driving metrics. To enable training of speed-conditioned policies, one approach is to collect expert demonstrations that strictly follow speed requirements, an expensive and unscalable process in the real world. An alternative is to adapt existing regular driving data by treating the speed observed in future frames as the target speed for training. To investigate this, we construct CustomizedSpeedDataset, composed of 2,100 clips annotated with experts demonstrations, enabling systematic investigation of supervision strategies. Our experiments show that, under proper re-annotation, models trained on regular driving data perform comparably to on expert demonstrations, suggesting that speed supervision can be introduced without additional complex real-world data collection. Furthermore, we find that while target-speed following can be achieved without degrading regular driving performance, executing overtaking commands remains challenging due to the inherent difficulty of interactive behaviors. All code, datasets and baselines are available at https://github.com/Thinklab-SJTU/Bench2Drive-Speed
広告