Value ranking

Best value on Time to First Token

Median time from request to first output chunk in milliseconds on the model's first-party API for medium-length prompts. Lower is snappier; reasoning models are penalised here because they think before talking.

“Value” is normalized benchmark score (0–100 for this leaderboard cohort) divided by input price per million tokens. Higher means more capability per dollar on this axis only — always sanity-check latency, context length, and your real workload.

1
Gemini 1.5 Flash
Google
1332.67
100.0 / $0.08/M
2
Gemini 2.0 Flash
Google
999.50
100.0 / $0.10/M
3
GPT-4o mini
OpenAI
666.67
100.0 / $0.15/M
4
Llama 4 Scout
Meta
555.06
99.9 / $0.18/M
5
Qwen3 235B
Alibaba (Qwen)
495.25
99.0 / $0.20/M
6
DeepSeek V3
DeepSeek
370.19
100.0 / $0.27/M
7
Llama 4 Maverick
Meta
369.37
99.7 / $0.27/M
8
Gemini 2.5 Flash
Google
333.13
99.9 / $0.30/M
9
DeepSeek R1
DeepSeek
181.73
100.0 / $0.55/M
10
Claude 3.5 Haiku
Anthropic
124.94
100.0 / $0.80/M
11
Llama 3.1 70B Instruct
Meta
113.48
99.9 / $0.88/M
12
Llama 3.3 70B Instruct
Meta
113.41
99.8 / $0.88/M
13
Qwen2.5 72B Instruct
Alibaba (Qwen)
109.89
98.9 / $0.90/M
14
o3-mini
OpenAI
81.85
90.0 / $1.10/M
15
Gemini 1.5 Pro
Google
79.96
100.0 / $1.25/M
16
o4-mini
OpenAI
66.95
73.7 / $1.10/M
17
Gemini 2.5 Pro
Google
54.11
67.6 / $1.25/M
18
Grok 2
xAI
49.98
100.0 / $2.00/M
19
Mistral Large 2
Mistral
49.94
99.9 / $2.00/M
20
GPT-4o
OpenAI
39.95
99.9 / $2.50/M
21
Grok 3
xAI
33.32
100.0 / $3.00/M
22
o1-mini
OpenAI
33.32
100.0 / $3.00/M
23
Claude 3.5 Sonnet
Anthropic
33.32
100.0 / $3.00/M
24
Claude 3.7 Sonnet
Anthropic
33.32
100.0 / $3.00/M
25
Claude Sonnet 4
Anthropic
33.09
99.3 / $3.00/M

Open full leaderboard

AI Model Analyzer does not recommend specific vendors; rankings are derived from public data only.