Unlocking Proactivity in Task-Oriented Dialogue

📰 ArXiv cs.AI

arXiv:2605.22240v1 Announce Type: new Abstract: Proactive task-oriented dialogue (TOD), such as outbound sales, demands a persuasive agent that actively probes the user's concerns and steers the conversation toward acceptance within a bounded number of turns. Yet post-trained LLMs are inherently conservative, and reward-shaping RL (e.g., GRPO) struggles since it only re-weights what an already passive policy samples. We show that conditioning on the user's latent concerns unlocks proactive capab

Published 23 May 2026

Read full paper → ← Back to Reads