CtrlAttack: A Unified Attack on World-Model Control in Diffusion Models
arXiv cs.CV / 3/17/2026
📰 NewsSignals & Early TrendsIdeas & Deep AnalysisModels & Research
Key Points
- The paper analyzes the vulnerability of diffusion-based image-to-video models by examining their world-model-like temporal dynamics and identifies trajectory-control as a new attack surface.
- It proposes CtrlAttack, which represents perturbations as a low-dimensional velocity field and creates a continuous displacement field via temporal integration to disrupt state evolution while preserving temporal coherence, usable in both white-box and black-box settings.
- Experiments show high attack success rates (over 90% in white-box and over 80% in black-box) with limited degradation to perceptual metrics (FID and FVD changes within 6 and 130, respectively).
- The work reveals security risks at the level of state dynamics in I2V models and calls for defenses to mitigate trajectory-level attacks.
💡 Insights using this article
This article is featured in our daily AI news digest — key takeaways and action items at a glance.
Related Articles
The Moonwell Oracle Exploit: How AI-Assisted 'Vibe Coding' Turned cbETH Into a $1.12 Token and Cost $1.78M
Dev.to
How CVE-2026-25253 exposed every OpenClaw user to RCE — and how to fix it in one command
Dev.to
Day 10: An AI Agent's Revenue Report — $29, 25 Products, 160 Tweets
Dev.to
Does Synthetic Data Generation of LLMs Help Clinical Text Mining?
Dev.to
What CVE-2026-25253 Taught Me About Building Safe AI Assistants
Dev.to