Controllable Generative Video Compression
arXiv cs.CV / 4/9/2026
💬 OpinionSignals & Early TrendsIdeas & Deep AnalysisModels & Research
Key Points
- The paper introduces Controllable Generative Video Compression (CGVC), aiming to reconcile the common tradeoff in perceptual video compression between perceptual realism and faithful signal fidelity.
- CGVC encodes representative keyframes and uses them as structural priors for generating non-keyframes, while also coding dense per-frame control signals to preserve finer details, structure, and semantics.
- Non-keyframes are reconstructed via a controllable generative video model that enforces temporal and content consistency guided by the provided priors.
- To improve color recovery, the authors propose a color-distance-guided keyframe selection algorithm that adaptively chooses keyframes based on color similarity.
- Experiments indicate CGVC outperforms prior perceptual video compression approaches on both objective signal fidelity and perceptual quality metrics.
Related Articles

Black Hat Asia
AI Business

Amazon CEO takes aim at Nvidia, Intel, Starlink, more in annual shareholder letter
TechCrunch

Why Anthropic’s new model has cybersecurity experts rattled
Reddit r/artificial
Does the AI 2027 paper still hold any legitimacy?
Reddit r/artificial

Why Most Productivity Systems Fail (And What to Do Instead)
Dev.to