LLM Foundation Models

Compare performance, pricing, and features of leading Large Language Models

Model & CreatorContext Window Quality Index Price (USD/1M) Tokens/s Latency (s)
OpenAI
o1-preview
OpenAI
128k86$27.56147.522.89
OpenAI
o1-mini
OpenAI
128k84$5.25225.010.21
Google
Gemini 2.0 Flash (exp)
Google
2000k82$0.00168.70.53
DeepSeek
DeepSeek V3
DeepSeek
128k80$0.4889.01.00
Google
Gemini 1.5 Pro (Sep)
Google
2000k80$2.1960.10.82
Anthropic
Claude 3.5 Sonnet (Oct)
Anthropic
200k80$6.0067.21.02
OpenAI
GPT-4o (May '24)
OpenAI
128k78$7.50102.70.63
OpenAI
GPT-4o (Aug '24)
OpenAI
128k78$4.3890.50.63
Alibaba
Qwen2.5 72B
Alibaba
131k77$0.4067.00.58
Anthropic
Claude 3.5 Sonnet (June)
Anthropic
200k76$6.0058.00.93
Amazon
Nova Pro
Amazon
300k75$1.4093.10.39
OpenAI
GPT-4 Turbo
OpenAI
128k75$15.0038.91.21
Mistral
Mistral Large 2 (Jul '24)
Mistral
128k74$3.0033.40.50
Mistral
Pixtral Large
Mistral
128k74$3.0037.80.39
Meta
Llama 3.1 405B
Meta
128k74$3.5029.80.73
Meta
Llama 3.3 70B
Meta
128k74$0.6772.80.48
OpenAI
GPT-4o (Nov '24)
OpenAI
128k73$4.38119.90.33
OpenAI
GPT-4o mini
OpenAI
128k73$0.26113.40.62
Google
Gemini 1.5 Flash (Sep)
Google
1000k72$0.13186.60.41
Anthropic
Claude 3 Opus
Anthropic
200k70$30.0025.72.01
Meta
Llama 3.2 90B (Vision)
Meta
128k68$0.8147.60.34
Meta
Llama 3.1 70B
Meta
128k68$0.7272.70.46
Anthropic
Claude 3.5 Haiku
Anthropic
200k68$1.6064.50.72
01.AI
Yi-Large
01.AI
32k61$3.0066.50.44
Google
Gemma 2 27B
Google
8k61$0.2648.10.75
Anthropic
Claude 3 Sonnet
Anthropic
200k57$6.0066.40.74
Cohere
Command-R+
Cohere
128k55$5.1950.20.48
Google
Gemma 2 9B
Google
8k55$0.12170.00.41
Anthropic
Claude 3 Haiku
Anthropic
200k55$0.50123.10.55
Meta
Llama 3.1 8B
Meta
128k54$0.10182.90.35
Meta
Llama 3.2 11B (Vision)
Meta
128k54$0.18132.10.30
Meta
Llama 3.2 3B
Meta
128k49$0.06249.60.38
Meta
Llama 3 70B
Meta
8k47$0.8949.90.40
Meta
Llama 3 8B
Meta
8k45$0.15122.90.34
Meta
Llama 3.2 1B
Meta
128k26$0.04468.40.38
OpenAI
GPT-4
OpenAI
8k-$37.5029.00.66
Meta
Llama 2 Chat 7B
Meta
4k-$0.33123.90.38
Google
Gemini 1.0 Pro
Google
33k-$0.75102.81.29

Methodology

While higher quality models are typically more expensive, they do not all follow the same price-quality curve.

Quality Index

Average result across our evaluations covering different dimensions of model intelligence. Currently includes MMLU, GPQA, Math & HumanEval. OpenAI o1 model figures are preliminary.

Price

Price per token, represented as USD per million Tokens. Price is a blend of Input & Output token prices (3:1 ratio).

Median across providers

Figures represent median (P50) across all providers which support the model.