AI Navigate

Decoupled Sensitivity-Consistency Learning for Weakly Supervised Video Anomaly Detection

arXiv cs.CV / 3/23/2026

📰 NewsModels & Research

Key Points

  • Weakly supervised video anomaly detection suffers from a sensitivity-stability trade-off, leading to either fragmented detections for transient events or over-smoothed responses for sustained anomalies.
  • The authors propose DeSC, a decoupled framework that trains two specialized streams with distinct optimization strategies: a temporal sensitivity stream and a semantic consistency stream.
  • The temporal sensitivity stream uses aggressive optimization to capture high-frequency abrupt changes, while the semantic consistency stream applies robust constraints to maintain long-term coherence and reduce noise.
  • A collaborative inference mechanism fuses the two streams to reduce individual biases and produce balanced predictions.
  • DeSC achieves new state-of-the-art results on UCF-Crime (89.37% AUC, +1.29%) and XD-Violence (87.18% AP, +2.22%), and code is available on GitHub for reproducibility.

Abstract

Recent weakly supervised video anomaly detection methods have achieved significant advances by employing unified frameworks for joint optimization. However, this paradigm is limited by a fundamental sensitivity-stability trade-off, as the conflicting objectives for detecting transient and sustained anomalies lead to either fragmented predictions or over-smoothed responses. To address this limitation, we propose DeSC, a novel Decoupled Sensitivity-Consistency framework that trains two specialized streams using distinct optimization strategies. The temporal sensitivity stream adopts an aggressive optimization strategy to capture high-frequency abrupt changes, whereas the semantic consistency stream applies robust constraints to maintain long-term coherence and reduce noise. Their complementary strengths are fused through a collaborative inference mechanism that reduces individual biases and produces balanced predictions. Extensive experiments demonstrate that DeSC establishes new state-of-the-art performance by achieving 89.37% AUC on UCF-Crime (+1.29%) and 87.18% AP on XD-Violence (+2.22%). Code is available at https://github.com/imzht/DeSC.