Diffusion Language Models for Speech Recognition

📰 ArXiv cs.AI

arXiv:2604.14001v1 Announce Type: cross Abstract: Diffusion language models have recently emerged as a leading alternative to standard language models, due to their ability for bidirectional attention and parallel text generation. In this work, we explore variants for their use in speech recognition. Specifically, we introduce a comprehensive guide to incorporating masked diffusion language models (MDLM) and uniform-state diffusion models (USDMs) for rescoring ASR hypotheses. Additionally, we de

Published 16 Apr 2026

Read full paper → ← Back to Reads