ResGuard: Enhancing Robustness Against Known Original Attacks in Deep Watermarking

arXiv cs.CV / 4/7/2026

💬 OpinionSignals & Early TrendsIdeas & Deep AnalysisModels & Research

Key Points

  • The paper identifies a key weakness in deep learning-based watermarking using the END architecture: Known Original Attacks (KOA), where adversaries with multiple original-watermarked image pairs can suppress watermarks via targeted strategies.
  • It demonstrates that a simple residual-estimation and subtraction method using known pairs can nearly eliminate the watermark while keeping the image quality high, highlighting insufficient image dependency in residuals.
  • The authors attribute this vulnerability to END frameworks producing residuals that are too transferable across images rather than tightly coupled to each host image.
  • They propose ResGuard, a plug-and-play module that improves KOA robustness by enforcing image-dependent embedding through a residual specificity enhancement loss.
  • ResGuard also uses an auxiliary KOA noise layer during training to make decoders more reliable under embedding inconsistencies, boosting average watermark extraction accuracy from 59.87% to 99.81%.

Abstract

Deep learning-based image watermarking commonly adopts an "Encoder-Noise Layer-Decoder" (END) architecture to improve robustness against random channel distortions, yet it often overlooks intentional manipulations introduced by adversaries with additional knowledge. In this paper, we revisit this paradigm and expose a critical yet underexplored vulnerability: the Known Original Attack (KOA), where an adversary has access to multiple original-watermarked image pairs, enabling various targeted suppression strategies. We show that even a simple residual-based removal approach, namely estimating an embedding residual from known pairs and subtracting it from unseen watermarked images, can almost completely remove the watermark while preserving visual quality. This vulnerability stems from the insufficient image dependency of residuals produced by END frameworks, which makes them transferable across images. To address this, we propose ResGuard, a plug-and-play module that enhances KOA robustness by enforcing image-dependent embedding. Its core lies in a residual specificity enhancement loss, which encourages residuals to be tightly coupled with their host images and thus improves image dependency. Furthermore, an auxiliary KOA noise layer injects residual-style perturbations during training, allowing the decoder to remain reliable under stronger embedding inconsistencies. Integrated into existing frameworks, ResGuard boosts KOA robustness, improving average watermark extraction accuracy from 59.87% to 99.81%.