Query-Conditioned Test-Time Self-Training for Large Language Models

📰 ArXiv cs.AI

arXiv:2605.13369v1 Announce Type: cross Abstract: Large language models (LLMs) are typically deployed with fixed parameters, and their performance is often improved by allocating more computation at inference time. While such test-time scaling can be effective, it cannot correct model misconceptions or adapt the model to the specific structure of an individual query. Test-time optimization addresses this limitation by enabling parameter updates during inference, but existing approaches either re

Published 14 May 2026
Read full paper → ← Back to Reads