PatchWorld: Gradient-Free Optimization of Executable World Models

📰 ArXiv cs.AI

arXiv:2605.30880v1 Announce Type: cross Abstract: Text-agent environments are typically modeled as partially observable Markov decision processes (POMDPs), assuming that the simulator's latent state and transition dynamics are hidden from the agent. Yet little work has examined whether executable code can be induced to serve as a world model for prediction and planning under partial observability. We introduce PatchWorld, a gradient-free framework that turns offline trajectories into executable

Published 1 Jun 2026
Read full paper → ← Back to Reads