Risk-Controllable Multi-View Diffusion for Driving Scenario Generation
arXiv cs.CV / 3/13/2026
📰 NewsModels & Research
Key Points
- RiskMV-DPO is a general pipeline enabling physically-informed, risk-controllable generation of multi-view driving scenarios by conditioning diffusion-based video synthesis on target risk levels and grounded risk modeling.
- The approach adds a geometry-appearance alignment module and a region-aware direct preference optimization (RA-DPO) with motion-aware masking to ensure spatial-temporal coherence and focus learning on dynamic regions.
- On the nuScenes dataset, RiskMV-DPO generates diverse long-tail scenarios while achieving state-of-the-art visual quality, increasing 3D detection mAP from 18.17 to 30.50 and reducing FID to 15.70.
- This work shifts world models from passive environment prediction to proactive, risk-controllable synthesis, offering a scalable toolchain for safety-oriented embodied intelligence development.
Related Articles
[D] Matryoshka Representation Learning
Reddit r/MachineLearning
Two new Qwen3.5 “Neo” fine‑tunes focused on fast, efficient reasoning
Reddit r/LocalLLaMA

HKIC, Gobi Partners and HKU team up for fund backing university research start-ups
SCMP Tech
Yann LeCun’s New LeWorldModel (LeWM) Research Targets JEPA Collapse in Pixel-Based Predictive World Modeling
MarkTechPost
Streaming experts
Simon Willison's Blog