SparseDriveV2: Scoring is All You Need for End-to-End Autonomous Driving

arXiv cs.CV / 4/1/2026

📰 NewsSignals & Early TrendsIdeas & Deep AnalysisModels & Research

Key Points

  • The paper argues that end-to-end autonomous driving can be improved primarily by scaling trajectory scoring, questioning whether dynamic proposal generation is fundamentally required.
  • A systematic scaling study on Hydra-MDP shows that planning performance increases with denser trajectory anchors and does not saturate early before computational limits.
  • SparseDriveV2 proposes two main upgrades to scoring-based planning: a factorized, combinatorially scalable trajectory vocabulary (geometric path + velocity profile) and a two-stage scoring strategy (coarse factorized scoring followed by fine scoring on a small composed set).
  • Using a lightweight ResNet-34 backbone, SparseDriveV2 reports strong benchmark results on NAVSIM (92.0 PDMS / 90.1 EPDMS) and Bench2Drive (89.15 Driving Score / 70.00 Success Rate).
  • The authors release code and models publicly via GitHub to support reproducibility and further research.

Abstract

End-to-end multi-modal planning has been widely adopted to model the uncertainty of driving behavior, typically by scoring candidate trajectories and selecting the optimal one. Existing approaches generally fall into two categories: scoring a large static trajectory vocabulary, or scoring a small set of dynamically generated proposals. While static vocabularies often suffer from coarse discretization of the action space, dynamic proposals provide finer-grained precision and have shown stronger empirical performance on existing benchmarks. However, it remains unclear whether dynamic generation is fundamentally necessary, or whether static vocabularies can already achieve comparable performance when they are sufficiently dense to cover the action space. In this work, we start with a systematic scaling study of Hydra-MDP, a representative scoring-based method, revealing that performance consistently improves as trajectory anchors become denser, without exhibiting saturation before computational constraints are reached. Motivated by this observation, we propose SparseDriveV2 to push the performance boundary of scoring-based planning through two complementary innovations: (1) a scalable vocabulary representation with a factorized structure that decomposes trajectories into geometric paths and velocity profiles, enabling combinatorial coverage of the action space, and (2) a scalable scoring strategy with coarse factorized scoring over paths and velocity profiles followed by fine-grained scoring on a small set of composed trajectories. By combining these two techniques, SparseDriveV2 achieves 92.0 PDMS and 90.1 EPDMS on NAVSIM, with 89.15 Driving Score and 70.00 Success Rate on Bench2Drive with a lightweight ResNet-34 as backbone. Code and model are released at https://github.com/swc-17/SparseDriveV2.