Math 500 Leaderboard
Compare models on Math 500 benchmarks.
Last updated: May 22, 2025
Rank | Model | Score | Organization | License | Updated At |
---|---|---|---|---|---|
1 | gpt-4o-miniOpenAI | 97.9% | OpenAI | Proprietary | May 22, 2025 |
2 | o3-miniOpenAI | 97.9% | OpenAI | Proprietary | May 22, 2025 |
3 | o1OpenAI | 96.4% | OpenAI | Proprietary | May 22, 2025 |
4 | o1-miniOpenAI | 90% | OpenAI | Proprietary | May 22, 2025 |
5 | google-gemini-2.0-flashGoogle | 89.7 | Proprietary | May 18, 2025 | |
6 | google-gemini-2.0-flash-liveGoogle | 89.7% | Proprietary | May 18, 2025 | |
7 | claude-3-7-sonnet-20250219Anthropic | 82.2% | Anthropic | Proprietary | May 22, 2025 |
8 | claude-3-7-sonnet-latestAnthropic | 82.2% | Anthropic | Proprietary | May 22, 2025 |
9 | anthropic-claude-3-7-sonnetAnthropic | 82.2% | Anthropic | Proprietary | May 22, 2025 |
10 | anthropic-claude-3-5-sonnetAnthropic | 78% | Anthropic | Proprietary | May 22, 2025 |
11 | claude-3-5-sonnet-20241022Anthropic | 78 % | Anthropic | Proprietary | May 22, 2025 |
12 | claude-3-5-sonnet-20240620Anthropic | 78% | Anthropic | Proprietary | May 22, 2025 |
13 | claude-3-5-sonnet-latestAnthropic | 78% | Anthropic | Proprietary | May 22, 2025 |
14 | claude-3-5-haiku-20241022Anthropic | 69.4% | Anthropic | Proprietary | May 22, 2025 |
15 | anthropic-claude-3-5-haikuAnthropic | 69.4% | Anthropic | Proprietary | May 22, 2025 |
16 | claude-3-5-haiku-latestAnthropic | 69.4% | Anthropic | Proprietary | May 22, 2025 |
17 | gpt-4o-2024-05-13OpenAI | 60.3% | OpenAI | Proprietary | May 22, 2025 |
18 | gpt-4o-2024-11-20OpenAI | 60.3% | OpenAI | Proprietary | May 22, 2025 |
19 | chatgpt-4oOpenAI | 60.3% | OpenAI | Proprietary | May 22, 2025 |
20 | gpt-4oOpenAI | 60.3% | OpenAI | Proprietary | May 22, 2025 |
21 | gpt-4o-2024-08-06OpenAI | 60.3% | OpenAI | Proprietary | May 22, 2025 |