Uncertainty-Aware Vision-based Risk Object Identification via Conformal Risk Tube Prediction

arXiv cs.CV / 3/26/2026

📰 NewsSignals & Early TrendsIdeas & Deep AnalysisModels & Research

Key Points

  • The paper addresses Vision-ROI for intelligent driving hazard detection, arguing that existing methods’ deterministic thresholds can fail in ambiguous, multi-risk scenarios by causing premature/delayed detection and temporally unstable predictions.
  • It proposes Conformal Risk Tube Prediction (CRTP), a spatiotemporal uncertainty-aware framework that outputs calibrated risk scores with uncertainty estimates and provides coverage guarantees for true risks.
  • To enable broader evaluation, the authors introduce a new dataset and metrics designed to test diverse scenario configurations and multi-risk coupling effects lacking in prior datasets.
  • The study analyzes how uncertainty estimation depends on scenario variation, per-risk category behavior, and perception error propagation, and reports substantial improvements over prior approaches, including reduced nuisance braking alerts.

Abstract

We study object importance-based vision risk object identification (Vision-ROI), a key capability for hazard detection in intelligent driving systems. Existing approaches make deterministic decisions and ignore uncertainty, which could lead to safety-critical failures. Specifically, in ambiguous scenarios, fixed decision thresholds may cause premature or delayed risk detection and temporally unstable predictions, especially in complex scenes with multiple interacting risks. Despite these challenges, current methods lack a principled framework to model risk uncertainty jointly across space and time. We propose Conformal Risk Tube Prediction, a unified formulation that captures spatiotemporal risk uncertainty, provides coverage guarantees for true risks, and produces calibrated risk scores with uncertainty estimates. To conduct a systematic evaluation, we present a new dataset and metrics probing diverse scenario configurations with multi-risk coupling effects, which are not supported by existing datasets. We systematically analyze factors affecting uncertainty estimation, including scenario variations, per-risk category behavior, and perception error propagation. Our method delivers substantial improvements over prior approaches, enhancing vision-ROI robustness and downstream performance, such as reducing nuisance braking alerts. For more qualitative results, please visit our project webpage: https://hcis-lab.github.io/CRTP/