AIME 2024 Leaderboard
Compare models on AIME 2024 benchmarks.
Rank | Model | Score | Organization | License |
---|---|---|---|---|
1 | o4-miniOpenAI | 93.4% | OpenAI | Proprietary |
2 | Gemini 2.5 Pro PreviewGoogle | 92 | Proprietary | |
3 | o3-2025-04-16OpenAI | 91.6% | OpenAI | Proprietary |
4 | o3OpenAI | 91.6% | OpenAI | Proprietary |
5 | Gemini 2.5 Flash PreviewGoogle | 88 | Proprietary | |
6 | GPT-4o miniOpenAI | 87.3% | OpenAI | Proprietary |
7 | o3-miniOpenAI | 87.3% | OpenAI | Proprietary |
8 | o1OpenAI | 79.2% | OpenAI | Proprietary |
9 | o1-miniOpenAI | 63.6% | OpenAI | Proprietary |
10 | GPT-4.1 miniOpenAI | 49.6% | OpenAI | Proprietary |
11 | GPT-4.1OpenAI | 48.1% | OpenAI | Proprietary |
12 | GPT-4.1 nanoOpenAI | 29.4 | OpenAI | Proprietary |
13 | Claude 3.7 SonnetAnthropic | 23.3% | Anthropic | Proprietary |
14 | Claude 3.5 SonnetAnthropic | 16% | Anthropic | Proprietary |
15 | GPT-4oOpenAI | 13.4% | OpenAI | Proprietary |
16 | GPT-4oOpenAI | 13.4% | OpenAI | Proprietary |
17 | ChatGPT-4oOpenAI | 13.4% | OpenAI | Proprietary |
18 | GPT-4oOpenAI | 13.4% | OpenAI | Proprietary |
19 | GPT-4oOpenAI | 13.4% | OpenAI | Proprietary |