Which Face and Whose Identity? Solving the Dual Challenge of Deepfake Proactive Forensics in Multi-Face Scenarios

arXiv cs.CV / 4/30/2026

📰 NewsIdeas & Deep AnalysisModels & Research

Key Points

  • The paper highlights that existing proactive deepfake forensics often assume a single-face setting, limiting their ability to localize forgeries and trace sources in real multi-person scenarios like group photos and meetings.
  • It introduces the Deep Attributable Watermarking Framework (DAWF), a multi-face encoder-decoder approach designed to embed watermarks efficiently in-network without traditional offline pre-processing.
  • DAWF uses a selective regional supervision loss to ensure the decoder concentrates only on facial regions that deepfakes have tampered with, improving localization quality.
  • By combining the targeted regional supervision with embedded identity payloads, DAWF aims to answer both “which region was forged” and “who was forged,” i.e., dual “which + who” attribution.
  • Experiments on challenging multi-face datasets reportedly show strong results for deepfake localization and traceability in complex multi-person scenes.

Abstract

Unlike single-face forgeries, deepfakes in complex multi-person interaction scenarios (such as group photos and multi-person meetings) more closely reflect real-world threats. Although existing proactive forensics solutions demonstrate good performance, they heavily rely on a "single-face" setting, making it difficult to effectively address the problems of deepfake localization and source tracing in complex multi-person environments. To address this challenge, we propose the Deep Attributable Watermarking Framework (DAWF). This framework adopts a novel multi-face encoder-decoder architecture that bypasses the cumbersome offline pre-processing steps of traditional forensics, facilitating efficient in-network parallel watermark embedding and cross-face collaborative processing. Crucially, we propose a selective regional supervision loss. This innovative mechanism guides the decoder to focus exclusively on the facial regions tampered with by deepfakes. Leveraging this mechanism alongside the embedded identity payloads, DAWF realizes the "which + who" goal, answering the dual questions of which facial region was forged and who was forged. Extensive experiments on challenging multi-face datasets show that DAWF achieves excellent deepfake localization and traceability in complex multi-person scenes.