LLM API Pricing Comparison
Compare input & output costs, context windows, and real-world pricing for all major AI models โ side by side.
API Pricing Index
69 models ยท Prices per 1 million tokens ยท Click headers to sort
| Model | Input / 1M | Output / 1M | Ratio | Context |
|---|---|---|---|---|
Ministral 3BMistral | $0.040 Cheapest | $0.040 | 1.0ร | 128.0K |
GPT-5 NanoOpenAI | $0.050 | $0.40 | 8.0ร | 200.0K |
Gemini 1.5 FlashGoogle | $0.075 | $0.30 | 4.0ร | 1.0M |
GPT-4.1 NanoOpenAI | $0.10 | $0.40 | 4.0ร | 1.0M |
Gemini 2.5 Flash-LiteGoogle | $0.10 | $0.40 | 4.0ร | 1.0M |
Gemini 2.0 FlashGoogle | $0.10 | $0.40 | 4.0ร | 1.0M |
Mistral Small 3Mistral | $0.10 | $0.30 | 3.0ร | 32.0K |
Ministral 8BMistral | $0.10 | $0.10 | 1.0ร | 128.0K |
Llama 4 ScoutMeta | $0.11 | $0.34 | 3.1ร | 10.0M Max |
GPT-4o MiniOpenAI | $0.15 | $0.60 | 4.0ร | 128.0K |
Mistral NemoMistral | $0.15 | $0.15 | 1.0ร | 128.0K |
Pixtral 12BMistral | $0.15 | $0.15 | 1.0ร | 128.0K |
GPT-5.4 NanoOpenAI | $0.20 | $1.25 | 6.3ร | 272.0K |
Llama 4 MaverickMeta | $0.20 | $0.60 | 3.0ร | 1.0M |
CodestralMistral | $0.20 | $0.60 | 3.0ร | 256.0K |
Sonar SmallPerplexity | $0.20 | $0.20 | 1.0ร | 127.0K |
Grok 4.1 FastxAI | $0.20 | $0.50 | 2.5ร | 2.0M |
Qwen 2.5 72BQwen | $0.23 | $0.40 | 1.7ร | 131.1K |
GPT-5 MiniOpenAI | $0.25 | $2.00 | 8.0ร | 200.0K |
Claude Haiku 3Anthropic | $0.25 | $1.25 | 5.0ร | 200.0K |
Gemini 3.1 Flash-LiteGoogle | $0.25 | $1.50 | 6.0ร | 1.0M |
DeepSeek V3DeepSeek | $0.28 | $0.42 | 1.5ร | 128.0K |
Gemini 2.5 FlashGoogle | $0.30 | $2.50 | 8.3ร | 1.0M |
GPT-4.1 MiniOpenAI | $0.40 | $1.60 | 4.0ร | 1.0M |
GPT-3.5 TurboOpenAI | $0.50 | $1.50 | 3.0ร | 16.4K |
Gemini 3 FlashGoogle | $0.50 | $3.00 | 6.0ร | 2.0M |
Qwen 3.5 PlusQwen | $0.50 | $2.00 | 4.0ร | 1.0M |
DeepSeek R1DeepSeek | $0.55 | $2.19 | 4.0ร | 128.0K |
LLaMA 3.3 70BMeta | $0.59 | $0.79 | 1.3ร | 131.1K |
GPT-5.4 MiniOpenAI | $0.75 | $4.50 | 6.0ร | 272.0K |
Claude Haiku 3.5Anthropic | $0.80 | $4.00 | 5.0ร | 200.0K |
Claude Haiku 4.5Anthropic | $1.00 | $5.00 | 5.0ร | 200.0K |
Sonar LargePerplexity | $1.00 | $1.00 | 1.0ร | 127.0K |
o4-miniOpenAI | $1.10 | $4.40 | 4.0ร | 200.0K |
o3-miniOpenAI | $1.10 | $4.40 | 4.0ร | 200.0K |
o1-miniOpenAI | $1.10 | $4.40 | 4.0ร | 200.0K |
GPT-5.1OpenAI | $1.25 | $10.00 | 8.0ร | 200.0K |
GPT-5OpenAI | $1.25 | $10.00 | 8.0ร | 200.0K |
Gemini 2.5 ProGoogle | $1.25 | $10.00 | 8.0ร | 2.0M |
Gemini 1.5 ProGoogle | $1.25 | $5.00 | 4.0ร | 2.0M |
Grok 4.3xAI | $1.25 | $2.50 | 2.0ร | 1.0M |
Grok 4.20xAI | $1.25 | $2.50 | 2.0ร | 2.0M |
GPT-5.2OpenAI | $1.75 | $14.00 | 8.0ร | 200.0K |
GPT-4.1OpenAI | $2.00 | $8.00 | 4.0ร | 1.0M |
o3OpenAI | $2.00 | $8.00 | 4.0ร | 200.0K |
Gemini 3.1 ProGoogle | $2.00 | $12.00 | 6.0ร | 2.0M |
Mistral Large 3Mistral | $2.00 | $6.00 | 3.0ร | 128.0K |
Pixtral LargeMistral | $2.00 | $6.00 | 3.0ร | 128.0K |
GPT-5.4OpenAI | $2.50 | $15.00 | 6.0ร | 272.0K |
GPT-4oOpenAI | $2.50 | $10.00 | 4.0ร | 128.0K |
Qwen 3.7 MaxQwen | $2.50 | $7.50 | 3.0ร | 1.0M |
Claude Sonnet 4.6Anthropic | $3.00 | $15.00 | 5.0ร | 1.0M |
Claude Sonnet 4.5Anthropic | $3.00 | $15.00 | 5.0ร | 200.0K |
Claude Sonnet 4Anthropic | $3.00 | $15.00 | 5.0ร | 200.0K |
Claude Sonnet 3.7Anthropic | $3.00 | $15.00 | 5.0ร | 200.0K |
Sonar ProPerplexity | $3.00 | $15.00 | 5.0ร | 200.0K |
Claude Opus 4.7Anthropic | $5.00 | $25.00 | 5.0ร | 1.0M |
Claude Opus 4.6Anthropic | $5.00 | $25.00 | 5.0ร | 1.0M |
Claude Opus 4.5Anthropic | $5.00 | $25.00 | 5.0ร | 200.0K |
Sonar HugePerplexity | $5.00 | $5.00 | 1.0ร | 127.0K |
GPT-5 ProOpenAI | $15.00 | $120.00 | 8.0ร | 200.0K |
o1OpenAI | $15.00 | $60.00 | 4.0ร | 200.0K |
Claude Opus 4.1Anthropic | $15.00 | $75.00 | 5.0ร | 200.0K |
Claude Opus 4Anthropic | $15.00 | $75.00 | 5.0ร | 200.0K |
Claude Opus 3Anthropic | $15.00 | $75.00 | 5.0ร | 200.0K |
o3-proOpenAI | $20.00 | $80.00 | 4.0ร | 200.0K |
GPT-5.2 ProOpenAI | $21.00 | $168.00 | 8.0ร | 200.0K |
GPT-5.4 ProOpenAI | $30.00 | $180.00 | 6.0ร | 272.0K |
o1-proOpenAI | $150.00 | $600.00 | 4.0ร | 200.0K |
Real Cost Comparison by Use Case
See exactly what each model costs for common workloads โ from a single chat to filling an entire context window.
Key Pricing Takeaways for 2026
Cheapest Overall
At $0.040 per 1M input tokens, Ministral 3B is the most affordable major LLM API. Ideal for high-volume, simple tasks where cost efficiency is paramount.
Best Value Mid-Tier
DeepSeek V3 offers GPT-4-class reasoning at $0.27/$1.10 per 1M tokens โ approximately 9ร cheaper than GPT-4o. Best choice for developers who need strong performance on a budget.
Largest Context Window
Llama 4 Scout's 10.0M token context window dwarfs all competitors. For processing entire codebases or very long documents, it's unmatched in capacity.
Best for Quality-Critical
Despite premium pricing at $5.00/$25.00 per 1M tokens, Claude Opus 4.7 remains the go-to for tasks where output quality matters most โ complex reasoning, nuanced writing, and multi-step analysis.
Ready to calculate your actual costs?
Paste your prompt into our free token calculator and see exact token counts, cost breakdowns, and monthly projections for any model.
Open Token Calculator