CLORE: Content-Level Optimization for Reasoning Efficiency

📰 ArXiv cs.AI

arXiv:2605.22211v1 Announce Type: new Abstract: Reinforcement learning post-training has improved the reasoning ability of large language models, but often produces unnecessarily long, repetitive, or semantically opaque reasoning traces. Existing efficient reasoning methods mainly regulate response length through explicit budgets or length-aware rewards, leaving intermediate reasoning content weakly supervised. We propose CLORE, a content-level optimization framework that improves reasoning effi

Published 23 May 2026

Read full paper → ← Back to Reads