Solving Physics Olympiad via Reinforcement Learning on Physics Simulators
📰 ArXiv cs.AI
arXiv:2604.11805v1 Announce Type: cross Abstract: We have witnessed remarkable advances in LLM reasoning capabilities with the advent of DeepSeek-R1. However, much of this progress has been fueled by the abundance of internet question-answer (QA) pairs, a major bottleneck going forward, since such data is limited in scale and concentrated mainly in domains like mathematics. In contrast, other sciences such as physics lack large-scale QA datasets to effectively train reasoning-capable models. In
DeepCamp AI