Learning collision risk proactively from naturalistic driving data at scale

arXiv cs.RO / 3/24/2026

💬 OpinionSignals & Early TrendsIdeas & Deep AnalysisModels & Research

Key Points

  • 研究は、都市の複雑な環境で今後の衝突リスクをドライバーや自動運転に対して早期に警告することを目的に、自然istic drivingデータからラベルなしで衝突リスクを学習する手法「GSSM(Generalised Surrogate Safety Measure)」を提案している。
  • GSSMは複数データセットで学習し、2,591件の実世界のクラッシュ/ヒヤリハット評価で、運動学的特徴のみの基本モデルでも精度再現率曲線のAUPRC=0.9、衝突回避のための中央値で2.6秒の予測先行時間を示した。
  • 追加の対話パターンやコンテキスト要因を取り込むことで性能向上が見られ、リアエンド、合流、右左折などの相互作用シナリオで既存ベースラインより高い精度と即時性を一貫して達成した。
  • 危険なインタラクションを「不可避になる前」に検知できるスケーラブルで汎用的な基盤として、自動運転の安全対策や交通インシデント管理への応用が期待される。

Abstract

Accurately and proactively alerting drivers or automated systems to emerging collisions is crucial for road safety, particularly in highly interactive and complex urban environments. Existing methods either require labour-intensive annotation of sparse risk, struggle to consider varying contextual factors, or are tailored to limited scenarios. Here we present the Generalised Surrogate Safety Measure (GSSM), a data-driven approach that learns collision risk from naturalistic driving without the need for crash or risk labels. Trained over multiple datasets and evaluated on 2,591 real-world crashes and near-crashes, a basic GSSM using only instantaneous motion kinematics achieves an area under the precision-recall curve of 0.9, and secures a median time advance of 2.6 seconds to prevent potential collisions. Incorporating additional interaction patterns and contextual factors provides further performance gains. Across interaction scenarios such as rear-end, merging, and turning, GSSM consistently outperforms existing baselines in accuracy and timeliness. These results establish GSSM as a scalable, context-aware, and generalisable foundation to identify risky interactions before they become unavoidable, supporting proactive safety in autonomous driving systems and traffic incident management. Code and experiment data are openly accessible at https://github.com/Yiru-Jiao/GSSM.