Beyond Surface Artifacts: Capturing Shared Latent Forgery Knowledge Across Modalities
arXiv cs.CV / 4/10/2026
📰 NewsSignals & Early TrendsIdeas & Deep AnalysisModels & Research
Key Points
- The paper argues that multimodal deepfake forensics suffers a generalization bottleneck because existing methods overfit to superficial, modality-specific artifacts rather than shared latent forgery cues.
- It proposes a modality-agnostic forgery (MAF) detection framework that decouples modality-specific styles to extract essential cross-modal latent forgery knowledge.
- The study introduces Weak MAF (transferability to semantically related modalities) and Strong MAF (robustness to isolated “dark modalities”) to measure different levels of generalization.
- To evaluate these limits, it presents the DeepModal-Bench benchmark, aggregating multimodal forgery detection algorithms and incorporating generalized learning approaches.
- The authors report empirical evidence for universal forgery traces and substantial performance gains on unseen modalities using the MAF framework, positioning it as a step toward universal multimodal defense.



