BFCL Leaderboard
Compare models on BFCL benchmarks.
Rank | Model | Score | Organization | License |
---|---|---|---|---|
1 | GPT-4o 2024-05-13OpenAI | 72.08 | OpenAI | Proprietary |
2 | GPT-4oOpenAI | 72.08 | OpenAI | Proprietary |
3 | GPT-4oOpenAI | 72.08 | OpenAI | Proprietary |
4 | GPT-4oOpenAI | 72.08% | OpenAI | Proprietary |
5 | o1OpenAI | 67.87 | OpenAI | Proprietary |
6 | GPT-4o miniOpenAI | 65.12% | OpenAI | Proprietary |
7 | o3-miniOpenAI | 65.12 | OpenAI | Proprietary |
8 | Gemini 2.0 FlashGoogle | 60.42 | Proprietary | |
9 | Gemini 2.0 FlashGoogle | 60.42 | Proprietary | |
10 | Claude 3.7 SonnetAnthropic | 58.3 | Anthropic | Proprietary |
11 | Claude 3.5 SonnetAnthropic | 56.46 | Anthropic | Proprietary |
12 | Claude 3.5 HaikuAnthropic | 54.31 | Anthropic | Proprietary |
13 | o3OpenAI | n/a | OpenAI | Proprietary |
14 | Claude 3 HaikuAnthropic | 54.31 | Anthropic | Proprietary |
15 | o1-miniOpenAI | 52.2 | OpenAI | Proprietary |