API model pricing and benchmark intelligence

Choose the right API model before you burn tokens.

AILeaderboard.live ranks API language models for developers across cost, speed, coding quality, and practical value using official provider data plus curated benchmark signals.

Current public snapshot updated Mar 10, 2026.

Launch provider set8

Models in current snapshot11

Largest context in catalog1M

Cheapest input rate$0.28

Top models right now

Switch personas to see how the winners change when price, speed, or coding quality matters most.

CheapestTop 3 snapshot

Lowest total token cost only.

Score = normalized total token cost only, where total cost = input price + output price.

DeepSeek ChatDeepSeek

Score 100.0

Pricing: Official LiveBenchmarks: Third-Party

Synced within 7d

Input$0.28/1M

Output$0.42/1M

Context128K tokens

Grok 3 MinixAI

Score 99.4

Pricing: Official LiveBenchmarks: Third-Party

Synced within 7d

Input$0.30/1M

Output$0.50/1M

Context131.1K tokens

Llama 3.3 70B VersatileGroq

Score 96.1

Pricing: Official LiveBenchmarks: Third-Party

Synced within 7d

Input$0.59/1M

Output$0.79/1M

Context131.1K tokens

FastestTop 3 snapshot

Lowest observed latency only.

Score = normalized latency only.

Llama 3.3 70B VersatileGroq

Score 100.0

Pricing: Official LiveBenchmarks: Third-Party

Synced within 7d

Input$0.59/1M

Output$0.79/1M

Context131.1K tokens

Gemini 2.5 FlashGoogle Gemini

Score 81.0

Pricing: Official LiveBenchmarks: Third-Party

Synced within 7d

Input$0.30/1M

Output$2.5/1M

Context1M tokens

Grok 3 MinixAI

Score 73.5

Pricing: Official LiveBenchmarks: Third-Party

Synced within 7d

Input$0.30/1M

Output$0.50/1M

Context131.1K tokens

SmartestTop 3 snapshot

Highest intelligence benchmark score only.

Score = normalized intelligence benchmark only.

GPT-4.1OpenAI

Score 100.0

Pricing: Official LiveBenchmarks: Third-Party

Synced within 7d

Input$2/1M

Output$8/1M

Context1M tokens

Gemini 2.5 ProGoogle Gemini

Score 94.4

Pricing: Official LiveBenchmarks: Third-Party

Synced within 7d

Input$1.25/1M

Output$10/1M

Context1M tokens

Claude Sonnet 4Anthropic

Score 88.9

Pricing: Official LiveBenchmarks: Third-Party

Synced within 7d

Input$3/1M

Output$15/1M

Context200K tokens

Best ValueTop 3 snapshot

The most balanced tradeoff between cost, quality, and responsiveness.

Score = 45% intelligence + 35% cost + 10% latency + 10% context.

GPT-4.1OpenAI

Score 72.0

Pricing: Official LiveBenchmarks: Third-Party

Synced within 7d

Input$2/1M

Output$8/1M

Context1M tokens

Gemini 2.5 ProGoogle Gemini

Score 66.2

Pricing: Official LiveBenchmarks: Third-Party

Synced within 7d

Input$1.25/1M

Output$10/1M

Context1M tokens

GPT-4.1 miniOpenAI

Score 60.8

Pricing: Official LiveBenchmarks: Third-Party

Synced within 7d

Input$0.40/1M

Output$1.6/1M

Context1M tokens

CodingTop 3 snapshot

Highest coding benchmark score only.

Score = normalized coding benchmark only.

Claude Sonnet 4Anthropic

Score 100.0

Pricing: Official LiveBenchmarks: Third-Party

Synced within 7d

Input$3/1M

Output$15/1M

Context200K tokens

GPT-4.1OpenAI

Score 87.5

Pricing: Official LiveBenchmarks: Third-Party

Synced within 7d

Input$2/1M

Output$8/1M

Context1M tokens

Gemini 2.5 ProGoogle Gemini

Score 70.8

Pricing: Official LiveBenchmarks: Third-Party

Synced within 7d

Input$1.25/1M

Output$10/1M

Context1M tokens

Browse full catalog Browse best value models Open flagship compare view

Model	Provider	Input	Output	Context	Value	Coverage
GPT-4.1 Smartest	OpenAI	$2	$8	1M	72.0	full
Gemini 2.5 Pro Smartest	Google Gemini	$1.25	$10	1M	66.2	full
GPT-4.1 mini Cheapest	OpenAI	$0.40	$1.6	1M	60.8	full
Mistral Large Cheapest	Mistral	$0.50	$1.5	128K	59.7	full
DeepSeek Chat Cheapest	DeepSeek	$0.28	$0.42	128K	59.3	full
Gemini 2.5 Flash Cheapest	Google Gemini	$0.30	$2.5	1M	58.9	full
Grok 3 Mini Cheapest	xAI	$0.30	$0.50	131.1K	49.7	full
Llama 3.3 70B Versatile Fastest	Groq	$0.59	$0.79	131.1K	48.7	full