Only Whats Necessary: Pareto Optimal Data Minimization for Privacy Preserving Video Anomaly Detection

arXiv cs.CV / 3/30/2026

💬 OpinionIdeas & Deep AnalysisModels & Research

Key Points

  • The paper proposes “Only What’s Necessary,” a privacy-by-design framework for video anomaly detection that enforces GDPR-style data minimization by limiting what visual information is exposed to the detection pipeline.
  • It uses both breadth-based and depth-based minimization to suppress PII (e.g., facial cues and sensitive demographic attributes) while retaining anomaly-relevant cues.
  • The authors evaluate multiple minimization configurations by testing minimized videos on both a video anomaly detection model and a separate privacy inference model to measure privacy-utility tradeoffs.
  • Pareto analysis plus ranking-based methods are used to identify non-dominated “sweet spot” operating points that reduce personal data exposure with limited loss in detection performance.
  • Extensive experiments on public datasets show the framework’s effectiveness, demonstrating how to tune minimization settings to balance compliance and detection accuracy.

Abstract

Video anomaly detection (VAD) systems are increasingly deployed in safety critical environments and require a large amount of data for accurate detection. However, such data may contain personally identifiable information (PII), including facial cues and sensitive demographic attributes, creating compliance challenges under the EU General Data Protection Regulation (GDPR). In particular, GDPR requires that personal data be limited to what is strictly necessary for a specified processing purpose. To address this, we introduce Only What's Necessary, a privacy-by-design framework for VAD that explicitly controls the amount and type of visual information exposed to the detection pipeline. The framework combines breadth based and depth based data minimization mechanisms to suppress PII while preserving cues relevant to anomaly detection. We evaluate a range of minimization configurations by feeding the minimized videos to both a VAD model and a privacy inference model. We employ two ranking based methods, along with Pareto analysis, to characterize the resulting trade off between privacy and utility. From the non-dominated frontier, we identify sweet spot operating points that minimize personal data exposure with limited degradation in detection performance. Extensive experiments on publicly available datasets demonstrate the effectiveness of the proposed framework.