Evaluating AI’s ability to perform scientific research tasks

📰 OpenAI News

OpenAI introduces FrontierScience to benchmark AI reasoning in physics, chemistry, and biology

advanced Published 16 Dec 2025

Action Steps

Explore the FrontierScience benchmark
Evaluate AI models using the benchmark
Compare results to measure progress in scientific research
Apply findings to improve AI reasoning in physics, chemistry, and biology

Who Needs to Know This

Researchers and AI engineers on a team benefit from FrontierScience as it helps measure progress toward real scientific research, and product managers can use it to evaluate AI capabilities

Key Insight

💡 FrontierScience helps measure AI progress in scientific research