Towards Video Anomaly Detection from Event Streams: A Baseline and Benchmark Datasets

arXiv cs.CV / 3/27/2026

📰 NewsSignals & Early TrendsIdeas & Deep AnalysisModels & Research

Key Points

  • The paper argues that event-based vision is a promising fit for video anomaly detection due to low redundancy, motion-centric signals, and privacy-preserving properties.
  • It introduces the first set of synchronized event-stream and RGB benchmark datasets for event-stream-based video anomaly detection, aiming to unify the research direction.
  • The proposed EWAD framework includes an event-density-aware dynamic sampling strategy to focus on informative time segments.
  • EWAD also uses density-modulated temporal modeling to extract context from sparse event streams and an RGB-to-event knowledge distillation mechanism to strengthen event representations under weak supervision.
  • Experiments across three benchmarks show significant gains over existing methods, and the datasets are planned to be publicly released.

Abstract

Event-based vision, characterized by low redundancy, focus on dynamic motion, and inherent privacy-preserving properties, naturally fits the demands of video anomaly detection (VAD). However, the absence of dedicated event-stream anomaly detection datasets and effective modeling strategies has significantly hindered progress in this field. In this work, we take the first major step toward establishing event-based VAD as a unified research direction. We first construct multiple event-stream based benchmarks for video anomaly detection, featuring synchronized event and RGB recordings. Leveraging the unique properties of events, we then propose an EVent-centric spatiotemporal Video Anomaly Detection framework, namely EWAD, with three key innovations: an event density aware dynamic sampling strategy to select temporally informative segments; a density-modulated temporal modeling approach that captures contextual relations from sparse event streams; and an RGB-to-event knowledge distillation mechanism to enhance event-based representations under weak supervision. Extensive experiments on three benchmarks demonstrate that our EWAD achieves significant improvements over existing approaches, highlighting the potential and effectiveness of event-driven modeling for video anomaly detection. The benchmark datasets will be made publicly available.
広告