AIME 2025
MathHow to Run
Use OpenAI simple-evals framework or manually evaluate against AIME 2025 problems
Leaderboard
| Rank | Model | Provider | Parameters | Score |
|---|---|---|---|---|
| 1 | GPT-5.2 Thinking | OpenAI | Unknown | 100.0% |
| 2 | Gemini 3 Pro | Unknown | 95.0% | |
| 3 | Claude Opus 4.5 | Anthropic | Unknown | 93.0% |
| 4 | DeepSeek-R1 | DeepSeek | 671B MoE | 79.2% |