AI Navigate

D5P4: Partition Determinantal Point Process for Diversity in Parallel Discrete Diffusion Decoding

arXiv cs.AI / 3/20/2026

📰 NewsIdeas & Deep AnalysisModels & Research

Key Points

  • The work introduces a generalized beam-search framework for discrete diffusion decoding that generates candidates in parallel and supports modular beam-selection objectives.
  • It proposes D5P4, a diversity-focused method that performs MAP inference over a Determinantal Point Process to improve in-batch diversity.
  • A scalable greedy solver enables multi-GPU compatibility and allows an explicit trade-off between model probability and target diversity with near-zero compute overhead.
  • Experiments on free-form generation and question answering show that D5P4 improves diversity while maintaining competitive generation quality.
  • The results suggest a practical approach to achieving controllable diversity in diffusion-based text generation, expanding the applicability of discrete diffusion models.

Abstract

Discrete diffusion models are promising alternatives to autoregressive approaches for text generation, yet their decoding methods remain under-studied. Standard decoding methods for autoregressive models, such as beam search, do not directly apply to iterative denoising, and existing diffusion decoding techniques provide limited control over in-batch diversity. To bridge this gap, we introduce a generalized beam-search framework for discrete diffusion that generates candidates in parallel and supports modular beam-selection objectives. As a diversity-focused instantiation, we propose D5P4, which formulates the selection step as MAP inference over a Determinantal Point Process. Leveraging a scalable greedy solver, D5P4 maintains multi-GPU compatibility and enables an explicit trade-off between model probability and target diversity with near-zero compute overhead. Experiments on free-form generation and question answering demonstrate that D5P4 improves diversity over strong baselines while maintaining competitive generation quality.