Towards Better Statistical Understanding of Watermarking LLMs
arXiv stat.ML / 4/8/2026
💬 OpinionSignals & Early TrendsIdeas & Deep AnalysisModels & Research
Key Points
- This paper analyzes the trade-off in LLM watermarking between model distortion and detector effectiveness by formulating it as a constrained optimization problem around red-green list watermarking.
- It derives an analytical property of the optimal solution and uses it to motivate a new online dual gradient ascent watermarking algorithm.
- The authors prove the proposed method achieves asymptotic Pareto optimality, providing explicit guarantees that improve detection ability via increased green-list probability while controlling distortion.
- They compare and justify watermark distortion metrics, arguing for KL divergence and highlighting problems with prior “distortion-free” and perplexity-based criteria.
- Experiments on extensive datasets show the algorithm performs against benchmark watermarking approaches, supporting the practical value of the theoretical formulation.
Related Articles

Black Hat Asia
AI Business
Meta's latest model is as open as Zuckerberg's private school
The Register

AI fuels global trade growth as China-US flows shift, McKinsey finds
SCMP Tech
Why multi-agent AI security is broken (and the identity patterns that actually work)
Dev.to
BANKING77-77: New best of 94.61% on the official test set (+0.13pp) over our previous tests 94.48%.
Reddit r/artificial