ID-Eraser: Proactive Defense Against Face Swapping via Identity Perturbation

arXiv cs.CV / 4/24/2026

📰 NewsDeveloper Stack & InfrastructureModels & Research

Key Points

  • The paper introduces ID-Eraser, a proactive defense against face swapping that targets identity information rather than relying on ineffective pixel-level perturbations.
  • ID-Eraser injects learnable perturbations into facial identity embeddings and uses a Face Revive Generator (FRG) to reconstruct visually natural-looking protective images.
  • Experiments in strict black-box settings show strong disruption of identity recognition and face swapping, including very low Top-1 accuracy (0.30) and strong perceptual/quality metrics (best FID 1.64, LPIPS 0.020).
  • The method reduces identity similarity of protected swaps to an average of 0.504 across multiple representative face swapping models and demonstrates cross-dataset generalization, distortion robustness, and effectiveness against commercial APIs (e.g., Tencent similarity 0.76 → 0.36).

Abstract

Deepfake technologies have rapidly advanced with modern generative AI, and face swapping in particular poses serious threats to privacy and digital security. Existing proactive defenses mostly rely on pixel-level perturbations, which are ineffective against contemporary swapping models that extract robust high-level identity embeddings. We propose ID-Eraser, a feature-space proactive defense that removes identifiable facial information to prevent malicious face swapping. By injecting learnable perturbations into identity embeddings and reconstructing natural-looking protection images through a Face Revive Generator (FRG), ID-Eraser produces visually realistic results for humans while rendering the protected identities unusable for Deepfake models. Experiments show that ID-Eraser substantially disrupts identity recognition across diverse face recognition and swapping systems under strict black-box settings, achieving the lowest Top-1 accuracy (0.30) with the best FID (1.64) and LPIPS (0.020). Compared with swaps generated from clean inputs, the identity similarity of protected swaps drops sharply to an average of 0.504 across five representative face swapping models. ID-Eraser further demonstrates strong cross-dataset generalization, robustness to common distortions, and practical effectiveness on commercial APIs, reducing Tencent API similarity from 0.76 to 0.36.