Recursive Agent Optimization

📰 ArXiv cs.AI

arXiv:2605.06639v1 Announce Type: cross Abstract: We introduce Recursive Agent Optimization (RAO), a reinforcement learning approach for training recursive agents: agents that can spawn and delegate sub-tasks to new instantiations of themselves recursively. Recursive agents implement an inference-time scaling algorithm that naturally allows agents to scale to longer contexts and generalize to more difficult problems via divide-and-conquer. RAO provides a method to train models to best take advan

Published 9 May 2026

Read full paper → ← Back to Reads