Best AI Models for Claude Code

CLI Agent

Claude Code is Anthropic's CLI agent for software engineering. It excels at multi-file editing, test writing, and complex refactoring. Benefits from large context windows and strong reasoning.

Last updated: 56m ago

Gemini 3.5 Flash

Google

Tool Score

Output $/M

$9.00

Arena Elo

1479

Claude Opus 4.7 (Fast)

Anthropic

Tool Score

Output $/M

$150.00

MiMo-V2.5-Pro

Xiaomi

Tool Score

Output $/M

$0.870

Arena Elo

1465

What Matters for Claude Code

Function CallingReasoningStreamingLarge ContextStrong Coding

Best Models for Claude Code

Top 15 by tool-optimized score

LMMarketCap.com

All Models Ranked for Claude Code (331 models)

Scored by: benchmark performance (90%) from MMLU, GPQA, HumanEval, SWE-bench, and 15+ standardized evaluations, with capabilities and context as tiebreakers (10%).

#	Model	Provider	Score	Coding	Caps	Output $/M	Context
1	Gemini 3.5 Flash Arena Elo: 1479	Google	86	97	100%	$9.00	1.0M
2	Claude Opus 4.7 (Fast)	Anthropic	85	95	100%	$150.00	1.0M
3	MiMo-V2.5-Pro Arena Elo: 1465	Xiaomi	85	94	100%	$0.870	1.0M
4	GLM 5.1 Arena Elo: 1474	Zhipu AI	85	96	100%	$3.08	203K
5	Qwen3.6 Max Preview Arena Elo: 1459	Alibaba	84	93	100%	$6.24	262K
6	Kimi K2.6 Arena Elo: 1462	Moonshot AI	84	94	100%	$3.49	262K
7	Gemma 4 31B Arena Elo: 1452	Google	84	92	100%	$0.370	262K
8	GLM 5 Arena Elo: 1457	Zhipu AI	84	93	100%	$1.92	203K
9	Grok 4.3 Arena Elo: 1447	xAI	83	91	100%	$2.50	1.0M
10	Claude Opus 4.6 (Fast)	Anthropic	83	90	100%	$150.00	1.0M
11	Qwen3.6 Plus Arena Elo: 1444	Alibaba	83	91	100%	$1.95	1.0M
12	MiMo-V2-Pro Arena Elo: 1448	Xiaomi	83	91	100%	$3.00	1.0M
13	GPT-5.4 Pro	OpenAI	83	92	100%	$180.00	1.1M
14	Gemini 3.1 Pro Preview Custom Tools	Google	83	92	100%	$12.00	1.0M
15	Qwen3.5 397B A17B Arena Elo: 1445	Alibaba	83	91	100%	$2.34	262K
16	GPT-5.2-Codex	OpenAI	83	90	100%	$14.00	400K
17	GLM 4.7 Arena Elo: 1443	Zhipu AI	83	91	100%	$1.75	203K
18	GPT-5.2 Pro	OpenAI	83	90	100%	$168.00	400K
19	Claude Opus 4.1 Arena Elo: 1449	Anthropic	83	92	100%	$75.00	200K
20	GPT-5.5 Pro SWE-bench: 88.7%	OpenAI	82	89	100%	$180.00	1.1M
21	GPT-5.5 SWE-bench: 88.7%	OpenAI	82	89	100%	$30.00	1.1M
22	DeepSeek V4 Flash Arena Elo: 1433	DeepSeek	82	89	100%	$0.200	1.0M
23	MiMo-V2.5 Arena Elo: 1434	Xiaomi	82	89	100%	$0.280	1.0M
24	Gemma 4 26B A4B Arena Elo: 1439	Google	82	90	100%	$0.330	262K
25	Grok 4.20	xAI	82	88	100%	$2.50	2.0M
26	Gemini 3.1 Flash Lite Preview Arena Elo: 1433	Google	82	89	100%	$1.50	1.0M
27	GPT-5.3-Codex	OpenAI	82	88	100%	$14.00	400K
28	GPT-5 Pro	OpenAI	82	88	100%	$120.00	400K
29	GPT-5 Codex	OpenAI	82	88	100%	$10.00	400K
30	Hy3 preview Arena Elo: 1416	Tencent	81	86	100%	$0.210	262K
31	Claude Opus 4.7 SWE-bench: 87.6%	Anthropic	81	88	100%	$25.00	1.0M
32	Qwen3.5-122B-A10B Arena Elo: 1417	Alibaba	81	86	100%	$2.08	262K
33	GPT-5.1-Codex-Max	OpenAI	81	87	100%	$10.00	400K
34	GPT-5.1-Codex	OpenAI	81	87	100%	$10.00	400K
35	GPT-5.1-Codex-Mini	OpenAI	81	87	100%	$2.00	400K
36	o3 Deep Research	OpenAI	81	86	100%	$40.00	200K
37	GLM 4.6 Arena Elo: 1426	Zhipu AI	81	88	100%	$1.74	203K
38	o3 Pro	OpenAI	81	86	100%	$80.00	200K
39	MiMo-V2-Omni Arena Elo: 1414	Xiaomi	80	86	100%	$2.00	262K
40	MiniMax M2.7 Arena Elo: 1413	MiniMax	80	86	100%	$1.20	205K
41	Qwen3.5-27B Arena Elo: 1408	Alibaba	80	85	100%	$1.56	262K
42	Qwen3.5-35B-A3B Arena Elo: 1396	Alibaba	79	83	100%	$1.00	262K
43	Qwen3.5-Flash Arena Elo: 1396	Alibaba	79	83	100%	$0.260	1.0M
44	Claude Opus 4.6 SWE-bench: 83.7%	Anthropic	79	84	100%	$25.00	1.0M
45	Step 3.5 Flash Arena Elo: 1394	StepFun	79	82	100%	$0.300	262K
46	DeepSeek V3.2 Exp Arena Elo: 1423	DeepSeek	79	87	100%	$0.410	164K
47	DeepSeek V3.1 Terminus Arena Elo: 1416	DeepSeek	79	86	100%	$0.950	164K
48	DeepSeek V3.1 Arena Elo: 1418	DeepSeek	79	86	100%	$0.790	164K
49	Gemini 2.5 Pro Preview 06-05	Google	79	83	100%	$10.00	1.0M
50	Gemini 2.5 Pro Preview 05-06	Google	79	83	100%	$10.00	1.0M

More Tool Rankings

Cursor Windsurf GitHub Copilot Aider Cline Roo Code Open WebUI Warp Continue Zed Lovable

Best for Coding Best for Reasoning Compare Models

Frequently Asked Questions

Based on our analysis of coding benchmarks, capability matching, and pricing, Gemini 3.5 Flash currently ranks #1 for Claude Code. Rankings are rebuilt as benchmark, pricing, and provider data refresh.

We score models using benchmark performance (90%) from LMArena, HumanEval, SWE-bench, MMLU, and 15+ standardized evaluations. Capabilities and context serve as tiebreakers (10%). Only models with the capabilities Claude Code needs are included in the tool-specific rankings.

We currently track 331 AI models compatible with Claude Code. This includes models from OpenAI, Anthropic, Google, DeepSeek, and other providers accessible via API.

Many open-source models are compatible with Claude Code through API providers like OpenRouter, Together AI, and Groq. Check our rankings to see which open-source models perform best.

Rankings refresh whenever the underlying benchmark, pricing, and catalog sources refresh. That means some signals update faster than others, and the page reflects the latest verified source data available.