AI Benchmark
Benchmarks
Models
Cost
Compare
About
Benchmarks
All
Coding
Japanese
Knowledge
Math
Overall
Reasoning
Vision
Math
AIME 2025
American Invitational Mathematics Examination 2025.
Metrics: Accuracy (%)
Dataset
Math
FrontierMath
Cutting-edge mathematics problems (Tiers 1-3).
Metrics: Accuracy (%)
Paper
Dataset