CSGuard: Toward Forgery-Resistant Watermarking in Diffusion Models via Compressed Sensing Constraint

arXiv cs.CV / 5/5/2026

📰 NewsModels & Research

Key Points

  • Latent-space watermarking for diffusion models can attribute content without extra training, but current schemes are vulnerable to forgery attacks that extract the watermark via inversion and re-generation with arbitrary prompts.
  • The paper introduces CSGuard, a forgery-resistant watermarking scheme that uses a compressed sensing constraint tied to a secret matrix to link generation and verification.
  • With CSGuard, only users who possess the secret matrix can correctly embed or verify watermarks, making it harder for unauthorized users to forge watermarked content.
  • Experiments report a drop in forgery attack success from 100.0% to 28.12%, while maintaining a 100% detection rate for benign watermarked images without degrading watermarking effectiveness.

Abstract

Latent-based diffusion model watermarking embeds watermarks into generated images' latent space to enable content attribution, offering a training-free solution for intellectual property protection and digital forensics. However, these methods exhibit a critical vulnerability to the forgery attack, attackers can extract the watermark by inverting the watermarked image and re-generating it with an arbitrary prompt, thereby enabling false attribution on malicious content. In this paper, we propose the CSGuard, the first forgery-resistant watermarking schema that leverages compressed sensing to bind the watermarked image generation and verification to a secret matrix. This ensures that only users possessing the secret matrix can correctly embed or verify the image watermark, prevents the illegal users from forgery without compromising generation quality and watermark integrity. Experimental results demonstrate that CSGuard achieves strong forgery resistance, reduces the attack success rate from 100.0\% to 28.12\%, and achieve 100\% detection rate on benign watermarked images without compromising watermarking effectiveness.