LM Arena WebDev Leaderboard
Compare models on LM Arena WebDev benchmarks.
Rank | Model | Score | Organization | License |
---|---|---|---|---|
1 | Gemini 2.5 Pro PreviewGoogle | 1420 | Proprietary | |
2 | Claude 3.7 SonnetAnthropic | 1357 | Anthropic | Proprietary |
3 | GPT-4.1OpenAI | 1257 | OpenAI | Proprietary |
4 | Claude 3.5 SonnetAnthropic | 1238 | Anthropic | Proprietary |
5 | o3-2025-04-16OpenAI | 1190 | OpenAI | Proprietary |
6 | o3OpenAI | 1190 | OpenAI | Proprietary |
7 | GPT-4.1 miniOpenAI | 1185 | OpenAI | Proprietary |
8 | Gemini 2.5 Flash PreviewGoogle | 1145 | Proprietary | |
9 | Claude 3.5 HaikuAnthropic | 1133 | Anthropic | Proprietary |
10 | o4-miniOpenAI | 1095 | OpenAI | Proprietary |
11 | o3-miniOpenAI | 1092 | OpenAI | Proprietary |
12 | o1OpenAI | 1045 | OpenAI | Proprietary |
13 | o1-miniOpenAI | 1042 | OpenAI | Proprietary |
14 | Gemini 2.0 FlashGoogle | 1039 | Proprietary | |
15 | Gemini 2.0 FlashGoogle | 1039 | Proprietary | |
16 | GPT-4oOpenAI | 964 | OpenAI | Proprietary |
17 | ChatGPT-4oOpenAI | 964 | OpenAI | Proprietary |
18 | GPT-4oOpenAI | 964 | OpenAI | Proprietary |
19 | Gemini 1.5 ProGoogle | 893 | Proprietary |