Machine Unlearning for Masked Diffusion Language Models

📰 ArXiv cs.AI

arXiv:2605.18253v1 Announce Type: cross Abstract: Recent masked diffusion language models (MDLMs), such as LLaDA and Dream, have achieved performance comparable to autoregressive large language models. Unlike autoregressive models, which generate text sequentially, MDLMs generate text by iteratively denoising masked positions in parallel. During fine-tuning, MDLMs learn to recover responses from masked response states conditioned on a prompt, thereby shifting their predictions from a prompt-mask

Published 19 May 2026
Read full paper → ← Back to Reads