Fusing Driver Perceived and Physical Risk for Safety Critical Scenario Screening in Autonomous Driving

arXiv cs.RO / 2026/3/24

💬 オピニオンIdeas & Deep AnalysisModels & Research

要点

  • The paper proposes a driver-risk fusion approach to efficiently screen safety-critical scenarios in autonomous driving testing using large-scale naturalistic data.
  • It avoids manual frame-by-frame risk annotation and expensive per-frame risk evaluation by generating supervision from a learned Driver Risk Field during training and predicting scenario-level risk scores with fast inference.
  • The method improves risk estimation with a risk height function and a speed-adaptive look-ahead mechanism, and it uses a dynamic cost model combining kinetic energy, oriented bounding box constraints, and Gaussian-kernel diffusion smoothing to better model interactions.
  • A risk trajectory cross-attention decoder jointly decodes risk and trajectories, aiming for more accurate and smoother risk estimates.
  • Experiments on INTERACTION and FLUID show improved performance, including AUC 0.792 and AP 0.825 on FLUID, outperforming PODAR by 9.1% (AUC) and 5.1% (AP).

Abstract

Autonomous driving testing increasingly relies on mining safety critical scenarios from large scale naturalistic driving data, yet existing screening pipelines still depend on manual risk annotation and expensive frame by frame risk evaluation, resulting in low efficiency and weakly grounded risk quantification. To address this issue, we propose a driver risk fusion based hazardous scenario screening method for autonomous driving. During training, the method combines an improved Driver Risk Field with a dynamic cost model to generate high quality risk supervision signals, while during inference it directly predicts scenario level risk scores through fast forward passes, avoiding per frame risk computation and enabling efficient large scale ranking and retrieval. The improved Driver Risk Field introduces a new risk height function and a speed adaptive look ahead mechanism, and the dynamic cost model integrates kinetic energy, oriented bounding box constraints, and Gaussian kernel diffusion smoothing for more accurate interaction modeling. We further design a risk trajectory cross attention decoder to jointly decode risk and trajectories. Experiments on the INTERACTION and FLUID datasets show that the proposed method produces smoother and more discriminative risk estimates. On FLUID, it achieves an AUC of 0.792 and an AP of 0.825, outperforming PODAR by 9.1 percent and 5.1 percent, respectively, demonstrating its effectiveness for scalable risk labeling and hazardous scenario screening.