SJD-VP: Speculative Jacobi Decoding with Verification Prediction for Autoregressive Image Generation

arXiv cs.CV / 3/31/2026

💬 OpinionIdeas & Deep AnalysisModels & Research

Key Points

  • The paper analyzes why Speculative Jacobi Decoding (SJD) can suffer from low acceptance rates, attributing the issue to ambiguity in speculative token selection.
  • It identifies a key observation: tokens whose probabilities increase across decoding iterations are more likely to be verification-accepted and correct.
  • Building on this, the authors propose Speculative Jacobi Decoding with Verification Prediction (SJD-VP), which uses probability changes over iterations to guide sampling toward tokens likely to pass verification.
  • SJD-VP is presented as plug-and-play, allowing easy integration with existing SJD methods without major redesign.
  • Experiments on standard benchmarks show SJD-VP both accelerates autoregressive decoding and improves image generation quality.

Abstract

Speculative Jacobi Decoding (SJD) has emerged as a promising method for accelerating autoregressive image generation. Despite its potential, existing SJD approaches often suffer from the low acceptance rate issue of speculative tokens due to token selection ambiguity. Recent works attempt to mitigate this issue primarily from the relaxed token verification perspective but fail to fully exploit the iterative dynamics of decoding. In this paper, we conduct an in-depth analysis and make a novel observation that tokens whose probabilities increase are more likely to match the verification-accepted and correct token. Based on this, we propose a novel Speculative Jacobi Decoding with Verification Prediction (SJD-VP). The key idea is to leverage the change in token probabilities across iterations to guide sampling, favoring tokens whose probabilities increase. This effectively predicts which tokens are likely to pass subsequent verification, boosting the acceptance rate. In particular, our SJD-VP is plug-and-play and can be seamlessly integrated into existing SJD methods. Extensive experiments on standard benchmarks demonstrate that our SJD-VP method consistently accelerates autoregressive decoding while improving image generation quality.