ANCORA: Learning to Question via Manifold-Anchored Self-Play for Verifiable Reasoning

📰 ArXiv cs.AI

arXiv:2604.27644v1 Announce Type: cross Abstract: We propose a paradigm shift from learning to answer to learning to question: can a language model generate verifiable problems, solve them, and turn the resulting feedback into self-improvement without human supervision? We introduce ANCORA, an anchored-curriculum framework in which a unified policy alternates between a Proposer that synthesizes novel specifications and a Solver that produces verified solutions. ANCORA rests on three load-bearing

Published 1 May 2026

Read full paper → ← Back to Reads