Curvelet-Based Frequency-Aware Feature Enhancement for Deepfake Detection

arXiv cs.AI / 4/15/2026

💬 OpinionSignals & Early TrendsIdeas & Deep AnalysisModels & Research

Key Points

  • The paper proposes a deepfake detector that leverages the Curvelet Transform to extract frequency-domain features with strong directional and multiscale properties, addressing robustness issues seen when detectors rely on spatial-domain cues.
  • It introduces wedge-level attention and scale-aware spatial masking to selectively emphasize discriminative frequency components, then reconstructs refined frequency cues for classification.
  • The reconstructed features are fed into a modified pretrained Xception network, combining frequency enhancement with established CNN classification.
  • On FaceForensics++ under two compression qualities, the approach reports 98.48% accuracy and 99.96% AUC on low compression while also retaining strong performance on high compression.
  • The authors highlight both effectiveness and interpretability of the Curvelet-informed forgery detection pipeline, positioning it as a new frequency representation for deepfake robustness.

Abstract

The proliferation of sophisticated generative models has significantly advanced the realism of synthetic facial content, known as deepfakes, raising serious concerns about digital trust. Although modern deep learning-based detectors perform well, many rely on spatial-domain features that degrade under compression. This limitation has prompted a shift toward integrating frequency-domain representations with deep learning to improve robustness. Prior research has explored frequency transforms such as Discrete Cosine Transform (DCT), Fast Fourier Transform (FFT), and Wavelet Transform, among others. However, to the best of our knowledge, the Curvelet Transform, despite its superior directional and multiscale properties, remains entirely unexplored in the context of deepfake detection. In this work, we introduce a novel Curvelet-based detection approach that enhances feature quality through wedge-level attention and scale-aware spatial masking, both trained to selectively emphasize discriminative frequency components. The refined frequency cues are reconstructed and passed to a modified pretrained Xception network for classification. Evaluated on two compression qualities in the challenging FaceForensics++ dataset, our method achieves 98.48% accuracy and 99.96% AUC on FF++ low compression, while maintaining strong performance under high compression, demonstrating the efficacy and interpretability of Curvelet-informed forgery detection.