AI Navigate

Top-b: Entropic Regulation of Relative Probability Bands in Autoregressive Language Processes

arXiv cs.CL / 3/17/2026

📰 NewsSignals & Early TrendsIdeas & Deep AnalysisModels & Research

Key Points

  • The paper introduces Top-b, an adaptive decoding strategy that tunes the candidate set according to the model's instantaneous entropy to address the limitations of static Top-k/Top-p rules.
  • It models generation as a trajectory on a relative probability manifold and uses a dynamic bandwidth coefficient linked to Shannon entropy to regulate sampling.
  • The authors show that Top-b acts as a variance-minimizing operator on the tail distribution of the model, effectively smoothing the sampling process.
  • Empirical results on GPQA and GSM8K indicate that Top-b reduces generation entropy and inter-decoding variance while maintaining competitive reasoning accuracy, functioning as a self-regulating control mechanism for autoregressive generation.

Abstract

Probabilistic language generators are theoretically modeled as discrete stochastic processes, yet standard decoding strategies (Top-k, Top-p) impose static truncation rules that fail to accommodate the dynamic information density of natural language. This misalignment often forces a suboptimal trade-off: static bounds are either too restrictive for high-entropy creative generation or too permissive for low-entropy logical reasoning. In this work, we formalize the generation process as a trajectory through a relative probability manifold. We introduce Top-b (Adaptive Relative Band Sampling), a decoding strategy that regulates the candidate set via a dynamic bandwidth coefficient coupled strictly to the instantaneous Shannon entropy of the model's distribution. We provide a theoretical framework demonstrating that Top-b acts as a variance-minimizing operator on the tail distribution. Empirical validation on GPQA and GSM8K benchmarks indicates that Top-b significantly reduces generation entropy and inter-decoding variance while maintaining competitive reasoning accuracy, effectively approximating a self-regulating control system for autoregressive generation.