AI Navigate

SLICE: Semantic Latent Injection via Compartmentalized Embedding for Image Watermarking

arXiv cs.CV / 3/16/2026

📰 NewsIdeas & Deep AnalysisModels & Research

Key Points

  • SLICE introduces semantic latent injection by compartmentalizing embedding into four factors—subject, environment, action, and detail—that bind to distinct regions of the initial Gaussian noise in diffusion models.
  • This fine-grained binding enables robust watermark verification with localizable tampering detection, addressing vulnerabilities of single global semantic bindings.
  • The method provides statistical guarantees on false-accept rates and outperforms existing baselines against advanced semantic-guided regeneration attacks, while preserving image quality and semantic fidelity.
  • It offers a training-free, practical provenance solution for reliable tamper localization in diffusion-based image generation.

Abstract

Watermarking the initial noise of diffusion models has emerged as a promising approach for image provenance, but content-independent noise patterns can be forged via inversion and regeneration attacks. Recent semantic-aware watermarking methods improve robustness by conditioning verification on image semantics. However, their reliance on a single global semantic binding makes them vulnerable to localized but globally coherent semantic edits. To address this limitation and provide a trustworthy semantic-aware watermark, we propose \underline{\textbf{S}}emantic \underline{\textbf{L}}atent \underline{\textbf{I}}njection via \underline{\textbf{C}}ompartmentalized \underline{\textbf{E}}mbedding (\textbf{SLICE}). Our framework decouples image semantics into four semantic factors (subject, environment, action, and detail) and precisely anchors them to distinct regions in the initial Gaussian noise. This fine-grained semantic binding enables advanced watermark verification where semantic tampering is detectable and localizable. We theoretically justify why SLICE enables robust and reliable tamper localization and provides statistical guarantees on false-accept rates. Experimental results demonstrate that SLICE significantly outperforms existing baselines against advanced semantic-guided regeneration attacks, substantially reducing attack success while preserving image quality and semantic fidelity. Overall, SLICE offers a practical, training-free provenance solution that is both fine-grained in diagnosis and robust to realistic adversarial manipulations.