DataMarch 31, 20264 min read

LLM Pricing Index — March 2026

Welcome to the March 2026 LLM Pricing Index. This month saw Anthropic solidify its position with Claude Sonnet 4.6, DeepSeek maintaining its aggressive budget pricing, and OpenAI holding steady with GPT-4o. Track all costs here.

The Price Per Million Tokens

Prices are listed in USD per 1 Million tokens. Sorted from least expensive to most expensive input cost.

ModelProviderInput / 1MOutput / 1M
Ministral 3BMistral$0.040$0.04
GPT-5 NanoOpenAI$0.050$0.40
Gemini 1.5 FlashGoogle$0.075$0.30
GPT-4.1 NanoOpenAI$0.10$0.40
Gemini 2.5 Flash-LiteGoogle$0.10$0.40
Gemini 2.0 FlashGoogle$0.10$0.40
Mistral Small 3Mistral$0.10$0.30
Ministral 8BMistral$0.10$0.10
Llama 4 ScoutMeta$0.11$0.34
GPT-4o MiniOpenAI$0.15$0.60
Mistral NemoMistral$0.15$0.15
Pixtral 12BMistral$0.15$0.15
GPT-5.4 NanoOpenAI$0.20$1.25
Llama 4 MaverickMeta$0.20$0.60
CodestralMistral$0.20$0.60
Sonar SmallPerplexity$0.20$0.20
Grok 4.1 FastxAI$0.20$0.50
Qwen 2.5 72BQwen$0.23$0.40
GPT-5 MiniOpenAI$0.25$2.00
Claude Haiku 3Anthropic$0.25$1.25
Gemini 3.1 Flash-LiteGoogle$0.25$1.50
DeepSeek V3DeepSeek$0.28$0.42
Gemini 2.5 FlashGoogle$0.30$2.50
GPT-4.1 MiniOpenAI$0.40$1.60
GPT-3.5 TurboOpenAI$0.50$1.50
Gemini 3 FlashGoogle$0.50$3.00
Qwen 3.5 PlusQwen$0.50$2.00
DeepSeek R1DeepSeek$0.55$2.19
LLaMA 3.3 70BMeta$0.59$0.79
GPT-5.4 MiniOpenAI$0.75$4.50
Claude Haiku 3.5Anthropic$0.80$4.00
Claude Haiku 4.5Anthropic$1.00$5.00
Sonar LargePerplexity$1.00$1.00
o4-miniOpenAI$1.10$4.40
o3-miniOpenAI$1.10$4.40
o1-miniOpenAI$1.10$4.40
GPT-5.1OpenAI$1.25$10.00
GPT-5OpenAI$1.25$10.00
Gemini 2.5 ProGoogle$1.25$10.00
Gemini 1.5 ProGoogle$1.25$5.00
Grok 4.3xAI$1.25$2.50
Grok 4.20xAI$1.25$2.50
GPT-5.2OpenAI$1.75$14.00
GPT-4.1OpenAI$2.00$8.00
o3OpenAI$2.00$8.00
Gemini 3.1 ProGoogle$2.00$12.00
Mistral Large 3Mistral$2.00$6.00
Pixtral LargeMistral$2.00$6.00
GPT-5.4OpenAI$2.50$15.00
GPT-4oOpenAI$2.50$10.00
Qwen 3.7 MaxQwen$2.50$7.50
Claude Sonnet 4.6Anthropic$3.00$15.00
Claude Sonnet 4.5Anthropic$3.00$15.00
Claude Sonnet 4Anthropic$3.00$15.00
Claude Sonnet 3.7Anthropic$3.00$15.00
Sonar ProPerplexity$3.00$15.00
Claude Opus 4.7Anthropic$5.00$25.00
Claude Opus 4.6Anthropic$5.00$25.00
Claude Opus 4.5Anthropic$5.00$25.00
Sonar HugePerplexity$5.00$5.00
GPT-5 ProOpenAI$15.00$120.00
o1OpenAI$15.00$60.00
Claude Opus 4.1Anthropic$15.00$75.00
Claude Opus 4Anthropic$15.00$75.00
Claude Opus 3Anthropic$15.00$75.00
o3-proOpenAI$20.00$80.00
GPT-5.2 ProOpenAI$21.00$168.00
GPT-5.4 ProOpenAI$30.00$180.00
o1-proOpenAI$150.00$600.00

Key Insights for March 2026

1. The Gap Between Flagship and "Mini" Models Has Widened

GPT-4o Mini ($0.15 input) is now a staggering 94% cheaper than GPT-4o ($2.50 input). For 80% of pipeline tasks (classification, basic extraction), developers are abandoning flagship models entirely.

2. DeepSeek Remains the Price/Performance King

At just $0.27 per 1M input tokens, DeepSeek V3 costs less than 10% of Claude Sonnet and GPT-4o while often matching their performance on coding and translation tasks.

3. Context Windows Determine RAG Costs

Filling Gemini 1.5 Pro's 2M context window costs exactly $2.50 per request. While powerful, doing this across 1,000 queries heavily outweighs the cost of setting up a proper vector database for RAG.

📚 Calculate Your Costs: