Gaussian Shannon: High-Precision Diffusion Model Watermarking Based on Communication

arXiv cs.CV / 3/30/2026

💬 OpinionIdeas & Deep AnalysisTools & Practical UsageModels & Research

共有:

Key Points

The paper introduces “Gaussian Shannon,” a diffusion-model watermarking framework that models the diffusion process as a noisy communication channel to enable both robust tracing and exact bit-level recovery of embedded watermark payloads.
Unlike threshold-based detectors that only allow fuzzy matching, the method embeds watermarks directly into the initial Gaussian noise and aims to recover structured metadata in a bit-exact, lossless way for use cases like licensing instructions.
It analyzes interference as local bit flips and global stochastic distortions, then applies a cascaded defense using error-correcting codes plus majority voting for reliable end-to-end payload transmission.
Experiments on multiple Stable Diffusion variants and diverse perturbations report state-of-the-art bit accuracy and high true positive rates while maintaining image quality without fine-tuning.
The authors provide an open-source implementation via the linked GitHub repository.

Abstract

Diffusion models generate high-quality images but pose serious risks like copyright violation and disinformation. Watermarking is a key defense for tracing and authenticating AI-generated content. However, existing methods rely on threshold-based detection, which only supports fuzzy matching and cannot recover structured watermark data bit-exactly, making them unsuitable for offline verification or applications requiring lossless metadata (e.g., licensing instructions). To address this problem, in this paper, we propose Gaussian Shannon, a watermarking framework that treats the diffusion process as a noisy communication channel and enables both robust tracing and exact bit recovery. Our method embeds watermarks in the initial Gaussian noise without fine-tuning or quality loss. We identify two types of channel interference, namely local bit flips and global stochastic distortions, and design a cascaded defense combining error-correcting codes and majority voting. This ensures reliable end-to-end transmission of semantic payloads. Experiments across three Stable Diffusion variants and seven perturbation types show that Gaussian Shannon achieves state-of-the-art bit-level accuracy while maintaining a high true positive rate, enabling trustworthy rights attribution in real-world deployment. The source code have been made available at: https://github.com/Rambo-Yi/Gaussian-Shannon

Black Hat Asia

AI Business

Mr. Chatterbox is a (weak) Victorian-era ethically trained model you can run on your own computer

Simon Willison's Blog

Beyond the Chatbot: Engineering Multi-Agent Ecosystems in 2026

Dev.to

I missed the "fun" part in software development

Dev.to

The Billion Dollar Tax on AI Agents

Dev.to

Gaussian Shannon: High-Precision Diffusion Model Watermarking Based on Communication

Key Points

Abstract

Related Articles

Black Hat Asia

Mr. Chatterbox is a (weak) Victorian-era ethically trained model you can run on your own computer

Beyond the Chatbot: Engineering Multi-Agent Ecosystems in 2026

I missed the "fun" part in software development

The Billion Dollar Tax on AI Agents

関連おすすめサービス

Notta搭載AI議事録イヤホン ZENCHORD1

AI搭載ボイスレコーダー Plaud

画像高画質化AIツール Aiarty Image Enhancer