Two-Stage Optimizer-Aware Online Data Selection for Large Language Models

📰 ArXiv cs.AI

Two-stage optimizer-aware online data selection for large language models improves fine-tuning efficiency

advanced Published 2 Apr 2026

Action Steps

Identify the limitations of existing gradient-based data selection methods in offline settings
Develop a two-stage optimizer-aware framework for online data selection
Implement the framework to adapt to sequential data arrival and step-dependent sample utility
Evaluate the effectiveness of the framework in improving fine-tuning efficiency

Who Needs to Know This

ML researchers and engineers working on large language models can benefit from this framework to optimize data selection and improve model performance

Key Insight

💡 Optimizer-aware online data selection can significantly improve the efficiency of large language model fine-tuning