Block-wise Codeword Embedding for Reliable Multi-bit Text Watermarking
arXiv cs.CL / 5/4/2026
📰 NewsIdeas & Deep AnalysisModels & Research
Key Points
- The paper argues that existing multi-bit text watermarking approaches for LLMs overemphasize capacity and improperly conflate decoding with detection, leading to unreliable verification.
- It shows that ECC-based watermark extractors can produce catastrophic false positive rates, and that simple rejection-threshold tweaks often reduce true positive rates to near-random guessing.
- The authors propose BREW (Block-wise Reliable Embedding for Watermarking), a framework based on “designated verification” rather than decoding-centric extraction.
- BREW uses a two-stage process: blind message estimation via independent block voting and window-shifting verification that checks the embedded payload against local edits.
- Experiments report strong reliability (TPR 0.965, FPR 0.02) under up to 10% synonym substitution and claim the approach is model-agnostic and scalable for forensic use.
Related Articles

When Claims Freeze Because a Provider Record Drifted: The Case for Enrollment Repair Agents
Dev.to

The Cash Is Already Earned: Why Construction Pay Application Exceptions Fit an Agent Better Than SaaS
Dev.to

Why Ship-and-Debit Claim Recovery Is a Better Agent Wedge Than Another “AI Back Office” Tool
Dev.to
AI is getting better at doing things, but still bad at deciding what to do?
Reddit r/artificial

I Built an AI-Powered Chinese BaZi (八字) Fortune Teller — Here's What DeepSeek Revealed About Destiny
Dev.to