Differences in Text Generated by Diffusion and Autoregressive Language Models

📰 ArXiv cs.AI

arXiv:2605.12522v1 Announce Type: cross Abstract: Diffusion language models (DLMs) are promising alternatives to autoregressive language models (ARMs), yet the intrinsic differences in their generated text remain underexplored. We first find empirically that off-the-shelf DLMs exhibit lower $n$-gram entropy, higher semantic coherence, and higher semantic diversity. To understand the cause, we conduct controlled experiments that decouple the effects of training objectives and decoding algorithms.

Published 14 May 2026

Read full paper → ← Back to Reads