AI Benchmark
Benchmarks
Models
Cost
Compare
About
Benchmarks
All
Coding
Japanese
Knowledge
Math
Overall
Reasoning
Vision
Knowledge
MMLU-Pro
Harder version of MMLU with 10 answer choices.
Metrics: Accuracy (%)
Paper
Dataset