Differences in Text Generated by Diffusion and Autoregressive Language Models
📰 ArXiv cs.AI
arXiv:2605.12522v1 Announce Type: cross Abstract: Diffusion language models (DLMs) are promising alternatives to autoregressive language models (ARMs), yet the intrinsic differences in their generated text remain underexplored. We first find empirically that off-the-shelf DLMs exhibit lower $n$-gram entropy, higher semantic coherence, and higher semantic diversity. To understand the cause, we conduct controlled experiments that decouple the effects of training objectives and decoding algorithms.
DeepCamp AI