CIPHER: Counterfeit Image Pattern High-level Examination via Representation

arXiv cs.CV / 4/1/2026

💬 OpinionSignals & Early TrendsIdeas & Deep AnalysisModels & Research

共有:

Key Points

CIPHER is a proposed deepfake detection framework designed to stay robust across multiple, rapidly evolving generative models like GANs and diffusion systems.
The method reuses and fine-tunes discriminators from existing image-generation models, extracting generation-agnostic artifacts using scale-adaptive and temporal-consistency features.
Experiments across nine state-of-the-art generative models show strong cross-model performance, reaching up to 74.33% F1-score and improving over ViT-based detectors by more than 30% on average.
CIPHER is reported to remain effective on difficult benchmark datasets where baseline detectors fail, achieving up to 88% F1-score on CIFAKE compared to near-zero for conventional approaches.
The authors argue that discriminator reuse and cross-model fine-tuning can provide a more generalizable and future-proof direction for deepfake detection as generative quality increases.

Abstract

The rapid progress of generative adversarial networks (GANs) and diffusion models has enabled the creation of synthetic faces that are increasingly difficult to distinguish from real images. This progress, however, has also amplified the risks of misinformation, fraud, and identity abuse, underscoring the urgent need for detectors that remain robust across diverse generative models. In this work, we introduce Counterfeit Image Pattern High-level Examination via Representation(CIPHER), a deepfake detection framework that systematically reuses and fine-tunes discriminators originally trained for image generation. By extracting scale-adaptive features from ProGAN discriminators and temporal-consistency features from diffusion models, CIPHER captures generation-agnostic artifacts that conventional detectors often overlook. Through extensive experiments across nine state-of-the-art generative models, CIPHER demonstrates superior cross-model detection performance, achieving up to 74.33% F1-score and outperforming existing ViT-based detectors by over 30% in F1-score on average. Notably, our approach maintains robust performance on challenging datasets where baseline methods fail, with up to 88% F1-score on CIFAKE compared to near-zero performance from conventional detectors. These results validate the effectiveness of discriminator reuse and cross-model fine-tuning, establishing CIPHER as a promising approach toward building more generalizable and robust deepfake detection systems in an era of rapidly evolving generative technologies.