Top-b: Entropic Regulation of Relative Probability Bands in Autoregressive Language Processes
arXiv cs.CL / 3/17/2026
📰 NewsSignals & Early TrendsIdeas & Deep AnalysisModels & Research
Key Points
- The paper introduces Top-b, an adaptive decoding strategy that tunes the candidate set according to the model's instantaneous entropy to address the limitations of static Top-k/Top-p rules.
- It models generation as a trajectory on a relative probability manifold and uses a dynamic bandwidth coefficient linked to Shannon entropy to regulate sampling.
- The authors show that Top-b acts as a variance-minimizing operator on the tail distribution of the model, effectively smoothing the sampling process.
- Empirical results on GPQA and GSM8K indicate that Top-b reduces generation entropy and inter-decoding variance while maintaining competitive reasoning accuracy, functioning as a self-regulating control mechanism for autoregressive generation.




