SurFITR: A Dataset for Surveillance Image Forgery Detection and Localisation

arXiv cs.CV / 4/9/2026

📰 NewsSignals & Early TrendsModels & Research

Key Points

  • SurFITR is a newly released public dataset designed to benchmark surveillance-style image forgery detection and localization, targeting the gap between existing forgery datasets and real surveillance conditions.
  • The dataset contains 137k+ tampered surveillance images spanning varied resolutions, viewpoints, occlusions, and subtle, localized manipulation types that reflect how evidence is often falsified.
  • SurFITR’s tampered data were generated using a multimodal LLM-powered pipeline that supports semantically aware, fine-grained edits across diverse scenes.
  • Experiments reported in the announcement show current forgery detectors perform significantly worse on SurFITR, while models trained on SurFITR improve both in-domain and cross-domain performance.
  • The dataset is available on GitHub and includes multiple editing models to diversify edit types and reduce overfitting to a single manipulation strategy.

Abstract

We present the Surveillance Forgery Image Test Range (SurFITR), a dataset for surveillance-style image forgery detection and localisation, in response to recent advances in open-access image generation models that raise concerns about falsifying visual evidence. Existing forgery models, trained on datasets with full-image synthesis or large manipulated regions in object-centric images, struggle to generalise to surveillance scenarios. This is because tampering in surveillance imagery is typically localised and subtle, occurring in scenes with varied viewpoints, small or occluded subjects, and lower visual quality. To address this gap, SurFITR provides a large collection of forensically valuable imagery generated via a multimodal LLM-powered pipeline, enabling semantically aware, fine-grained editing across diverse surveillance scenes. It contains over 137k tampered images with varying resolutions and edit types, generated using multiple image editing models. Extensive experiments show that existing detectors degrade significantly on SurFITR, while training on SurFITR yields substantial improvements in both in-domain and cross-domain performance. SurFITR is publicly available on GitHub.