AMA

Head-to-head

Claude Sonnet 4 (Thinking) vs FLUX.1.1 [pro]

Normalized scores are min-maxed per benchmark across all models we track (0–100). Open the interactive compare view to add benchmarks to the radar chart or pull in more models.

Claude Sonnet 4 (Thinking)
Anthropic
FLUX.1.1 [pro]
Black Forest Labs
BenchmarkClaude Sonnet 4 (Thinking)FLUX.1.1 [pro]
LiveCodeBench

LiveCB

Image Arena Elo

Img Arena

Prompt Adherence

Prompt Fid.

Output Speed

Speed

Time to First Token

TTFT

Rolling Contamination-Controlled Average

Rolling Avg

Rolling Data Analysis

Data Analysis

Humanity's Last Exam

HLE

Methodology matches the main AI Model Analyzer About page.