LiveBench IF Leaderboard
Compare models on LiveBench IF benchmarks.
Last updated: May 22, 2025
Rank | Model | Score | Organization | License | Updated At |
---|---|---|---|---|---|
1 | o3-2025-04-16OpenAI | 86.17 | OpenAI | Proprietary | May 22, 2025 |
2 | o3OpenAI | 86.17 | OpenAI | Proprietary | May 22, 2025 |
3 | google-gemini-2.0-flashGoogle | 85.79 | Proprietary | May 18, 2025 | |
4 | google-gemini-2.0-flash-liveGoogle | 85.79 | Proprietary | May 18, 2025 | |
5 | o4-miniOpenAI | 84.96 | OpenAI | Proprietary | May 22, 2025 |
6 | google-gemini-2.5-pro-previewGoogle | 83.50 | Proprietary | May 18, 2025 | |
7 | claude-opus-4-20250514Anthropic | 80.74 | Anthropic | Proprietary | May 22, 2025 |
8 | google-gemini-2.5-flash-previewGoogle | 79.02 | Proprietary | May 18, 2025 | |
9 | claude-opus-4-0Anthropic | 78.38 | Anthropic | Proprietary | May 22, 2025 |
10 | claude-sonnet-4-20250514Anthropic | 77.25 | Anthropic | Proprietary | May 22, 2025 |
11 | claude-sonnet-4-0Anthropic | 77.25 | Anthropic | Proprietary | May 22, 2025 |
12 | gpt-4.1OpenAI | 77.05 | OpenAI | Proprietary | May 22, 2025 |
13 | google-gemini-2.0-flash-liteGoogle | 76.63 | Proprietary | May 18, 2025 | |
14 | claude-3-7-sonnet-20250219Anthropic | 76.49 | Anthropic | Proprietary | May 22, 2025 |
15 | claude-3-7-sonnet-latestAnthropic | 76.49 | Anthropic | Proprietary | May 22, 2025 |
16 | anthropic-claude-3-7-sonnetAnthropic | 76.49 | Anthropic | Proprietary | May 22, 2025 |
17 | chatgpt-4oOpenAI | 71.92 | OpenAI | Proprietary | May 22, 2025 |
18 | gpt-4oOpenAI | 71.92 | OpenAI | Proprietary | May 22, 2025 |
19 | gpt-4.1-miniOpenAI | 70.31 | OpenAI | Proprietary | May 22, 2025 |
20 | anthropic-claude-3-5-sonnetAnthropic | 69.30 | Anthropic | Proprietary | May 22, 2025 |
21 | claude-3-5-sonnet-20241022Anthropic | 69.30 | Anthropic | Proprietary | May 22, 2025 |
22 | claude-3-5-sonnet-20240620Anthropic | 69.30 | Anthropic | Proprietary | May 22, 2025 |
23 | claude-3-5-sonnet-latestAnthropic | 69.30 | Anthropic | Proprietary | May 22, 2025 |
24 | gpt-4o-2024-05-13OpenAI | 64.94 | OpenAI | Proprietary | May 22, 2025 |
25 | gpt-4o-2024-11-20OpenAI | 64.94 | OpenAI | Proprietary | May 22, 2025 |
26 | gpt-4o-2024-08-06OpenAI | 64.94 | OpenAI | Proprietary | May 22, 2025 |
27 | claude-3-5-haiku-20241022Anthropic | 61.88 | Anthropic | Proprietary | May 22, 2025 |
28 | anthropic-claude-3-5-haikuAnthropic | 61.88 | Anthropic | Proprietary | May 22, 2025 |
29 | claude-3-5-haiku-latestAnthropic | 61.88 | Anthropic | Proprietary | May 22, 2025 |
30 | gpt-4.1-nanoOpenAI | 57.54 | OpenAI | Proprietary | May 22, 2025 |
31 | gpt-4o-miniOpenAI | 56.80 | OpenAI | Proprietary | May 22, 2025 |