Draft-and-Target Sampling for Video Generation Policy
arXiv cs.CV / 3/17/2026
💬 OpinionIdeas & Deep AnalysisModels & Research
Key Points
- The paper proposes Draft-and-Target Sampling, a training-free diffusion inference paradigm to improve the efficiency of video generation policies.
- It introduces a self-play denoising approach with two complementary trajectories: draft sampling for fast global trajectory generation and target sampling for refinement with small steps.
- Speed improvements are further achieved via token chunking and a progressive acceptance strategy to reduce redundant computation, achieving up to 2.1x speedup on three benchmarks.
- The results show improved efficiency with minimal compromise to success rate, and the authors provide their code publicly.




