Head-to-head

FLUX.1.1 [pro] vs GPT-5.5

Normalized scores are min-maxed per benchmark across all models we track (0–100). Open the interactive compare view to add benchmarks to the radar chart or pull in more models.

Interactive compare FLUX.1.1 [pro] profile GPT-5.5 profile

FLUX.1.1 [pro]

Black Forest Labs

GPT-5.5

OpenAI

Benchmark	FLUX.1.1 [pro]	GPT-5.5
Chatbot Arena Elo Arena	—
Image Arena Elo Img Arena		—
Prompt Adherence Prompt Fid.		—
FrontierMath Tiers 1-3 FrontierMath	—
SimpleQA Verified SimpleQA	—
OTIS Mock AIME 2024-2025 OTIS AIME	—
ARC-AGI 2 ARC-AGI 2	—
Terminal-Bench 2 TermBench 2	—
Frontier Composite Frontier	—
Output Stability Stability	—
Format Adherence Format	—
Recovery Rate Recovery	—
Safety Handling Safety	—

Methodology matches the main AI Model Analyzer About page.