GIFGuard: Proactive Forensics against Deepfakes in Facial GIFs via Spatiotemporal Watermarking

arXiv cs.CV / 4/30/2026

📰 NewsSignals & Early TrendsIdeas & Deep AnalysisModels & Research

Key Points

  • The paper introduces GIFGuard, a first-of-its-kind spatiotemporal watermarking framework designed specifically for proactive forensics against deepfakes in animated GIF facial imagery.
  • It proposes STARE, a 3D-convolution-based encoder with adaptive channel recalibration to embed watermarks robustly even when higher-level semantic content is tampered with.
  • For watermark retrieval, it presents DIRD, a spatiotemporal hourglass decoder with 3D attention that restores latent features to extract watermark signals accurately under strong facial manipulation.
  • The authors also release GIFfaces, a new large-scale benchmark dataset for GIF proactive forensics, and report strong visual quality alongside robustness against deepfake attacks.
  • They state that related code and the dataset will be released to enable further research and validation in this area.

Abstract

The rapid evolution of deepfake technology poses an unprecedented threat to the authenticity of Graphics Interchange Format (GIF) imagery, which serves as a representative of short-loop temporal media in social networks. However, existing proactive forensics works are designed for static images, which limits their applicability to animated GIFs. To bridge this gap, we propose GIFGuard, the first spatiotemporal watermarking framework tailored for deepfake proactive forensics in GIFs. In the embedding stage, we propose the Spatiotemporal Adaptive Residual Encoder (STARE) to ensure robustness against high-level semantic tampering. It employs a 3D convolutional backbone with adaptive channel recalibration to capture globally coherent temporal dependencies. In the extraction stage, we design the Deep Integrity Restoration Decoder (DIRD). It utilizes a spatiotemporal hourglass architecture equipped with 3D attention to restore latent features, allowing for the accurate extraction of watermark signals even under severe facial manipulation. Furthermore, we construct GIFfaces, the first large-scale benchmark dataset curated for GIF proactive forensics to facilitate research in this domain. Extensive results show that GIFGuard achieves high-fidelity visual quality and remarkable robustness performance against deepfakes. Related code and dataset will be released.