Towards Robust Content Watermarking Against Removal and Forgery Attacks

arXiv cs.CV / 4/9/2026

💬 OpinionIdeas & Deep AnalysisModels & Research

Key Points

  • The paper addresses vulnerabilities in existing content watermarking methods for text-to-image diffusion models, specifically resistance to removal and forgery attacks.
  • It proposes a new paradigm called Instance-Specific watermarking with Two-Sided detection (ISTS) that adjusts watermark injection time and patterns dynamically based on the semantics of user prompts.
  • The authors introduce a two-sided detection method intended to improve robustness of watermark detection under adversarial conditions.
  • Experimental results indicate that the proposed ISTS approach outperforms prior watermarking techniques when subjected to removal and forgery attacks.

Abstract

Generated contents have raised serious concerns about copyright protection, image provenance, and credit attribution. A potential solution for these problems is watermarking. Recently, content watermarking for text-to-image diffusion models has been studied extensively for its effective detection utility and robustness. However, these watermarking techniques are vulnerable to potential adversarial attacks, such as removal attacks and forgery attacks. In this paper, we build a novel watermarking paradigm called Instance-Specific watermarking with Two-Sided detection (ISTS) to resist removal and forgery attacks. Specifically, we introduce a strategy that dynamically controls the injection time and watermarking patterns based on the semantics of users' prompts. Furthermore, we propose a new two-sided detection approach to enhance robustness in watermark detection. Experiments have demonstrated the superiority of our watermarking against removal and forgery attacks.