CounterScene: Counterfactual Causal Reasoning in Generative World Models for Safety-Critical Closed-Loop Evaluation

arXiv cs.RO / 3/24/2026

💬 OpinionIdeas & Deep AnalysisModels & Research

共有:

Key Points

The paper introduces CounterScene, a framework for generating safety-critical driving scenarios by using structured counterfactual causal reasoning rather than heuristic adversarial perturbations.
CounterScene identifies the causally critical agent and conflict types, then uses a causal interaction graph inside a conflict-aware interactive BEV world model to model inter-agent dependencies in closed loop.
It generates counterfactuals via stage-adaptive minimal interventions that remove the agent’s safety margins while letting risk emerge through natural propagation, aiming to balance adversarial strength and realism.
Experiments on nuScenes show improved long-horizon performance, including higher collision-risk emergence (12.3% to 22.7% over the strongest baseline) alongside better trajectory realism (ADE 1.88 vs. 2.09).
The method also demonstrates stronger advantages over longer rollouts and generalizes zero-shot to nuPlan with state-of-the-art realism.

Abstract

Generating safety-critical driving scenarios requires understanding why dangerous interactions arise, rather than merely forcing collisions. However, existing methods rely on heuristic adversarial agent selection and unstructured perturbations, lacking explicit modeling of interaction dependencies and thus exhibiting a realism--adversarial trade-off. We present CounterScene, a framework that endows closed-loop generative BEV world models with structured counterfactual reasoning for safety-critical scenario generation. Given a safe scene, CounterScene asks: what if the causally critical agent had behaved differently? To answer this, we introduce causal adversarial agent identification to identify the critical agent and classify conflict types, and develop a conflict-aware interactive world model in which a causal interaction graph is used to explicitly model dynamic inter-agent dependencies. Building on this structure, stage-adaptive counterfactual guidance performs minimal interventions on the identified agent, removing its spatial and temporal safety margins while allowing risk to emerge through natural interaction propagation. Extensive experiments on nuScenes demonstrate that CounterScene achieves the strongest adversarial effectiveness while maintaining superior trajectory realism across all horizons, improving long-horizon collision rate from 12.3% to 22.7% over the strongest baseline with better realism (ADE 1.88 vs.2.09). Notably, this advantage further widens over longer rollouts, and CounterScene generalizes zero-shot to nuPlan with state-of-the-art realism.