Gaussian Shannon: High-Precision Diffusion Model Watermarking Based on Communication

arXiv cs.CV / 3/30/2026

💬 OpinionIdeas & Deep AnalysisTools & Practical UsageModels & Research

Key Points

  • The paper introduces “Gaussian Shannon,” a diffusion-model watermarking framework that models the diffusion process as a noisy communication channel to enable both robust tracing and exact bit-level recovery of embedded watermark payloads.
  • Unlike threshold-based detectors that only allow fuzzy matching, the method embeds watermarks directly into the initial Gaussian noise and aims to recover structured metadata in a bit-exact, lossless way for use cases like licensing instructions.
  • It analyzes interference as local bit flips and global stochastic distortions, then applies a cascaded defense using error-correcting codes plus majority voting for reliable end-to-end payload transmission.
  • Experiments on multiple Stable Diffusion variants and diverse perturbations report state-of-the-art bit accuracy and high true positive rates while maintaining image quality without fine-tuning.
  • The authors provide an open-source implementation via the linked GitHub repository.

Abstract

Diffusion models generate high-quality images but pose serious risks like copyright violation and disinformation. Watermarking is a key defense for tracing and authenticating AI-generated content. However, existing methods rely on threshold-based detection, which only supports fuzzy matching and cannot recover structured watermark data bit-exactly, making them unsuitable for offline verification or applications requiring lossless metadata (e.g., licensing instructions). To address this problem, in this paper, we propose Gaussian Shannon, a watermarking framework that treats the diffusion process as a noisy communication channel and enables both robust tracing and exact bit recovery. Our method embeds watermarks in the initial Gaussian noise without fine-tuning or quality loss. We identify two types of channel interference, namely local bit flips and global stochastic distortions, and design a cascaded defense combining error-correcting codes and majority voting. This ensures reliable end-to-end transmission of semantic payloads. Experiments across three Stable Diffusion variants and seven perturbation types show that Gaussian Shannon achieves state-of-the-art bit-level accuracy while maintaining a high true positive rate, enabling trustworthy rights attribution in real-world deployment. The source code have been made available at: https://github.com/Rambo-Yi/Gaussian-Shannon