Throughput
Fastest text models (hosted APIs)
Sorted by our normalized output-speed benchmark (higher is faster relative to models we track). Latency and TTFT differ — use the interactive leaderboard’s realtime-chat scenario for a blended view.
- 1100.0o1-miniOpenAI
- 2100.0Claude 3.5 SonnetAnthropic
- 3100.0Claude 3.5 HaikuAnthropic
- 4100.0Claude 3 OpusAnthropic
- 5100.0Claude 3.7 SonnetAnthropic
- 6100.0Gemini 1.5 ProGoogle
- 7100.0Gemini 1.5 FlashGoogle
- 8100.0Gemini 2.0 FlashGoogle
- 9100.0Grok 2xAI
- 10100.0DeepSeek V3DeepSeek
- 11100.0DeepSeek R1DeepSeek
- 1286.8Gemini 2.5 FlashGoogle
- 1357.9o4-miniOpenAI
- 1457.9o3-miniOpenAI
- 1555.3GPT-4oOpenAI
- 1647.4Llama 4 MaverickMeta
- 1744.7Gemini 2.5 ProGoogle
- 1844.7Llama 4 ScoutMeta
- 1936.8o1OpenAI
- 2031.6GPT-5OpenAI
- 2131.6Llama 3.3 70B InstructMeta
- 2223.7o3OpenAI
- 2318.4GPT-4o miniOpenAI
- 2418.4Qwen3 235BAlibaba (Qwen)
- 2513.2Grok 3xAI
- 2613.2Qwen2.5 72B InstructAlibaba (Qwen)
- 2710.5Claude Sonnet 4 (Thinking)Anthropic
- 287.9Claude Sonnet 4Anthropic
- 297.9Grok 4xAI
- 305.3Mistral Large 2Mistral
- 312.6GPT-4 TurboOpenAI
- 322.6Claude Opus 4Anthropic
- 332.6Claude Opus 4 (Thinking)Anthropic
- 342.6Llama 3.1 70B InstructMeta
- 350.0Llama 3.1 405B InstructMeta
Speed figures are aggregated from observable public sources; see AI Model Analyzer About for methodology.