AI Model Arena
43 frontier AI models scored across 8 dimensions — reasoning, coding, math, multimodal, speed, cost, context, safety. Side-by-side comparison and dynamic ranking. Updated April 2026.
Score Weights:Reasoning 20%Coding 20%Math 15%Multimodal 15%Speed 10%Cost Efficiency 10%Context 5%Safety 5%
42 models tracked · Auto-generated badges show the leader in each category.
| # | Model | Total | Compare |
|---|---|---|---|
| 🥇 | Gemini 3.1 Pro🇺🇸Best OverallBest Multimodal Google DeepMind · 2026-04 | 92.6 | |
| 🥈 | DeepSeek V4🇨🇳Best Value DeepSeek · 2026-04 | 91.6 | |
| 🥉 | Gemini 2.5 Pro🇺🇸 Google DeepMind · 2026-01 | 91.3 | |
| 4 | GPT-5.5🇺🇸Best Reasoning OpenAI · 2026-04 | 91.1 | |
| 5 | Claude Sonnet 4.6🇺🇸 Anthropic · 2026-01 | 91.1 | |
| 6 | Claude Opus 4.7🇺🇸Best CodingSafest Anthropic · 2026-03 | 90.3 | |
| 7 | Qwen3.5 Max🇨🇳 Alibaba · 2026-03 | 90.1 | |
| 8 | GPT-5🇺🇸 OpenAI · 2025-12 | 90.1 | |
| 9 | Llama 4.5🇺🇸 Meta · 2026-03 | 89.7 | |
| 10 | DeepSeek R2🇨🇳 DeepSeek · 2026-02 | 89.4 | |
| 11 | GPT-4.1🇺🇸 OpenAI · 2025-04 | 89.1 | |
| 12 | Grok 5🇺🇸 xAI · 2026-03 | 88.6 | |
| 13 | DeepSeek V3.5🇨🇳 DeepSeek · 2026-02 | 88.4 | |
| 14 | Claude 3.7 Sonnet🇺🇸 Anthropic · 2025-02 | 88.2 | |
| 15 | Qwen3 Max🇨🇳 Alibaba · 2025-09 | 87.8 | |
| 16 | Gemini 2.0 Flash🇺🇸Fastest Google DeepMind · 2025-02 | 87.5 | |
| 17 | GLM-5🇨🇳 Zhipu AI · 2026-02 | 87.4 | |
| 18 | Mistral Large 3🇫🇷 Mistral AI · 2026-02 | 87.2 | |
| 19 | o4🇺🇸Best Math OpenAI · 2026-02 | 87.0 | |
| 20 | Claude Haiku 4.5🇺🇸 Anthropic · 2025-10 | 87.0 | |
| 21 | Kimi K2.5🇨🇳 Moonshot AI · 2026-03 | 86.9 | |
| 22 | GPT-4o🇺🇸 OpenAI · 2024-05 | 86.8 | |
| 23 | Grok 4🇺🇸 xAI · 2025-11 | 86.7 | |
| 24 | Llama 4🇺🇸 Meta · 2025-04 | 86.7 | |
| 25 | Gemini 1.5 Pro🇺🇸 Google DeepMind · 2024-05 | 85.9 | |
| 26 | o3🇺🇸 OpenAI · 2025-04 | 85.7 | |
| 27 | o3-mini🇺🇸 OpenAI · 2025-01 | 85.6 | |
| 28 | DeepSeek R1🇨🇳 DeepSeek · 2025-01 | 85.4 | |
| 29 | GLM-4.5🇨🇳 Zhipu AI · 2025-07 | 85.1 | |
| 30 | Doubao Pro🇨🇳 ByteDance · 2024-12 | 84.3 | |
| 31 | Kimi K2🇨🇳 Moonshot AI · 2025-08 | 84.3 | |
| 32 | ERNIE 5🇨🇳 Baidu · 2025-11 | 84.1 | |
| 33 | Yi-Lightning🇨🇳 01.AI · 2024-10 | 83.3 | |
| 34 | Mistral Large 2🇫🇷 Mistral AI · 2024-07 | 82.8 | |
| 35 | Amazon Nova Pro🇺🇸 Amazon · 2024-12 | 82.5 | |
| 36 | MiniMax Text-01🇨🇳 MiniMax · 2025-01 | 82.1 | |
| 37 | Phi-4🇺🇸 Microsoft · 2024-12 | 80.4 | |
| 38 | Sonar Large🇺🇸 Perplexity · 2025-01 | 80.1 | |
| 39 | Command R+🇨🇦 Cohere · 2024-08 | 79.8 | |
| 40 | Qwen2.5-Coder 32B🇨🇳 Alibaba · 2024-11 | 79.8 | |
| 41 | Llama 3.3 70B🇺🇸 Meta · 2024-12 | 79.5 | |
| 42 | Codestral 2🇫🇷 Mistral AI · 2025-01 | 79.0 |