ARC-AGI-2
ReasoningHow to Run
pip install arckit && Download tasks from arcprize.org && python evaluate.py
Leaderboard
| Rank | Model | Provider | Parameters | Score |
|---|---|---|---|---|
| 1 | GPT-5.2 | OpenAI | Unknown | 52.9% |
| 2 | Gemini 3 Deep Think | Unknown | 45.1% | |
| 3 | Claude Opus 4.5 | Anthropic | Unknown | 37.6% |
| 4 | Gemini 3 Pro | Unknown | 31.1% |