Entropy-informed Decoding: Adaptive Information-Driven Branching

📰 ArXiv cs.AI

arXiv:2605.09745v1 Announce Type: cross Abstract: Large language models (LLMs) achieve remarkable generative performance, yet their output quality is dependent on the decoding strategy. While sampling-based methods (e.g., top-k, nucleus) and search-and-select based methods (e.g., beam search, best-of-n, majority voting) can improve upon greedy decoding, both approaches suffer from limitations: sampling generally commits to a single path, while search often expends excessive computation regardles

Published 12 May 2026

Read full paper → ← Back to Reads