Latent Action Reparameterization for Efficient Agent Inference

📰 ArXiv cs.AI

arXiv:2605.18597v2 Announce Type: new Abstract: Large language model (LLM) agents often rely on long sequences of low-level textual actions, resulting in large effective decision horizons and high inference cost. While prior work has focused on improving inference efficiency through system-level optimizations or prompt engineering, we argue that a key bottleneck lies in the representation of the action space itself. We propose Latent Action Reparameterization (LAR), a framework that learns a com

Published 19 May 2026

Read full paper → ← Back to Reads