SPaCe: Unlocking Sample-Efficient Large Language Models Training With Self-Pace Curriculum Learning
📰 ArXiv cs.AI
arXiv:2508.05015v2 Announce Type: replace-cross Abstract: Large language models (LLMs) have shown strong reasoning capabilities when fine-tuned with reinforcement learning (RL). However, such methods require extensive data and compute, making them impractical under many realistic training budgets. Many existing pipelines sample training examples uniformly across steps or epochs, ignoring differences in difficulty, redundancy, and learning value, which slows learning and wastes computation. We pr
DeepCamp AI