The Shift to LLM Post-Training... What's Actually Driving It?
short discussion with Zichen Liu author of the Dr. GRPO paper about what motivated him to switch his research focus to LLM post-training.
#LLM #ReinforcementLearning #PostTraining
Watch on YouTube ↗
(saves to browser)
DeepCamp AI