API model pricing and benchmark intelligence

Choose the right API model before you burn tokens.

AILeaderboard.live ranks API language models for developers across cost, speed, coding quality, and practical value using official provider data plus curated benchmark signals.

Current public snapshot updated Mar 10, 2026.

Launch provider set8
Models in current snapshot11
Largest context in catalog1M
Cheapest input rate$0.28

Top models right now

Switch personas to see how the winners change when price, speed, or coding quality matters most.

CheapestTop 3 snapshot

Lowest total token cost only.

Score = normalized total token cost only, where total cost = input price + output price.

#1
Score 100.0
Pricing: Official LiveBenchmarks: Third-Party
Synced within 7d
Input$0.28/1M
Output$0.42/1M
Context128K tokens
Score 99.4
Pricing: Official LiveBenchmarks: Third-Party
Synced within 7d
Input$0.30/1M
Output$0.50/1M
Context131.1K tokens
Score 96.1
Pricing: Official LiveBenchmarks: Third-Party
Synced within 7d
Input$0.59/1M
Output$0.79/1M
Context131.1K tokens
FastestTop 3 snapshot

Lowest observed latency only.

Score = normalized latency only.

Score 100.0
Pricing: Official LiveBenchmarks: Third-Party
Synced within 7d
Input$0.59/1M
Output$0.79/1M
Context131.1K tokens
#2
Gemini 2.5 FlashGoogle Gemini
Score 81.0
Pricing: Official LiveBenchmarks: Third-Party
Synced within 7d
Input$0.30/1M
Output$2.5/1M
Context1M tokens
Score 73.5
Pricing: Official LiveBenchmarks: Third-Party
Synced within 7d
Input$0.30/1M
Output$0.50/1M
Context131.1K tokens
SmartestTop 3 snapshot

Highest intelligence benchmark score only.

Score = normalized intelligence benchmark only.

#1
GPT-4.1OpenAI
Score 100.0
Pricing: Official LiveBenchmarks: Third-Party
Synced within 7d
Input$2/1M
Output$8/1M
Context1M tokens
#2
Gemini 2.5 ProGoogle Gemini
Score 94.4
Pricing: Official LiveBenchmarks: Third-Party
Synced within 7d
Input$1.25/1M
Output$10/1M
Context1M tokens
#3
Score 88.9
Pricing: Official LiveBenchmarks: Third-Party
Synced within 7d
Input$3/1M
Output$15/1M
Context200K tokens
Best ValueTop 3 snapshot

The most balanced tradeoff between cost, quality, and responsiveness.

Score = 45% intelligence + 35% cost + 10% latency + 10% context.

#1
GPT-4.1OpenAI
Score 72.0
Pricing: Official LiveBenchmarks: Third-Party
Synced within 7d
Input$2/1M
Output$8/1M
Context1M tokens
#2
Gemini 2.5 ProGoogle Gemini
Score 66.2
Pricing: Official LiveBenchmarks: Third-Party
Synced within 7d
Input$1.25/1M
Output$10/1M
Context1M tokens
#3
Score 60.8
Pricing: Official LiveBenchmarks: Third-Party
Synced within 7d
Input$0.40/1M
Output$1.6/1M
Context1M tokens
CodingTop 3 snapshot

Highest coding benchmark score only.

Score = normalized coding benchmark only.

#1
Score 100.0
Pricing: Official LiveBenchmarks: Third-Party
Synced within 7d
Input$3/1M
Output$15/1M
Context200K tokens
#2
GPT-4.1OpenAI
Score 87.5
Pricing: Official LiveBenchmarks: Third-Party
Synced within 7d
Input$2/1M
Output$8/1M
Context1M tokens
#3
Gemini 2.5 ProGoogle Gemini
Score 70.8
Pricing: Official LiveBenchmarks: Third-Party
Synced within 7d
Input$1.25/1M
Output$10/1M
Context1M tokens
ModelProviderInputOutputContextValueCoverage
GPT-4.1
Smartest
OpenAI$2$81M72.0full
Gemini 2.5 Pro
Smartest
Google Gemini$1.25$101M66.2full
GPT-4.1 mini
Cheapest
OpenAI$0.40$1.61M60.8full
Mistral Large
Cheapest
Mistral$0.50$1.5128K59.7full
DeepSeek Chat
Cheapest
DeepSeek$0.28$0.42128K59.3full
Gemini 2.5 Flash
Cheapest
Google Gemini$0.30$2.51M58.9full
Grok 3 Mini
Cheapest
xAI$0.30$0.50131.1K49.7full
Llama 3.3 70B Versatile
Fastest
Groq$0.59$0.79131.1K48.7full