BFCL Leaderboard
Compare models on BFCL benchmarks.
Last updated: May 22, 2025
Rank | Model | Score | Organization | License | Updated At |
---|---|---|---|---|---|
1 | gpt-4o-2024-05-13OpenAI | 72.08% | OpenAI | Proprietary | May 22, 2025 |
2 | gpt-4o-2024-11-20OpenAI | 72.08% | OpenAI | Proprietary | May 22, 2025 |
3 | chatgpt-4oOpenAI | 72.08% | OpenAI | Proprietary | May 22, 2025 |
4 | gpt-4oOpenAI | 72.08% | OpenAI | Proprietary | May 22, 2025 |
5 | gpt-4o-2024-08-06OpenAI | 72.08% | OpenAI | Proprietary | May 22, 2025 |
6 | o1OpenAI | 67.87% | OpenAI | Proprietary | May 22, 2025 |
7 | gpt-4o-miniOpenAI | 65.12% | OpenAI | Proprietary | May 22, 2025 |
8 | o3-miniOpenAI | 65.12% | OpenAI | Proprietary | May 22, 2025 |
9 | google-gemini-2.0-flashGoogle | 60.42 | Proprietary | May 18, 2025 | |
10 | google-gemini-2.0-flash-liveGoogle | 60.42 | Proprietary | May 18, 2025 | |
11 | claude-3-7-sonnet-20250219Anthropic | 58.3% | Anthropic | Proprietary | May 22, 2025 |
12 | claude-3-7-sonnet-latestAnthropic | 58.3% | Anthropic | Proprietary | May 22, 2025 |
13 | anthropic-claude-3-7-sonnetAnthropic | 58.3% | Anthropic | Proprietary | May 22, 2025 |
14 | anthropic-claude-3-5-sonnetAnthropic | 56.46% | Anthropic | Proprietary | May 22, 2025 |
15 | claude-3-5-sonnet-20241022Anthropic | 56.46 % | Anthropic | Proprietary | May 22, 2025 |
16 | claude-3-5-sonnet-20240620Anthropic | 56.46% | Anthropic | Proprietary | May 22, 2025 |
17 | claude-3-5-sonnet-latestAnthropic | 56.46% | Anthropic | Proprietary | May 22, 2025 |
18 | claude-3-5-haiku-20241022Anthropic | 54.31% | Anthropic | Proprietary | May 22, 2025 |
19 | anthropic-claude-3-5-haikuAnthropic | 54.31% | Anthropic | Proprietary | May 22, 2025 |
20 | claude-3-5-haiku-latestAnthropic | 54.31% | Anthropic | Proprietary | May 22, 2025 |
21 | o1-miniOpenAI | 52.2% | OpenAI | Proprietary | May 22, 2025 |