AMA

Head-to-head

FLUX.1.1 [pro] vs GPT-5.5

Normalized scores are min-maxed per benchmark across all models we track (0–100). Open the interactive compare view to add benchmarks to the radar chart or pull in more models.

FLUX.1.1 [pro]
Black Forest Labs
GPT-5.5
OpenAI
BenchmarkFLUX.1.1 [pro]GPT-5.5
Chatbot Arena Elo

Arena

Image Arena Elo

Img Arena

Prompt Adherence

Prompt Fid.

FrontierMath Tiers 1-3

FrontierMath

SimpleQA Verified

SimpleQA

OTIS Mock AIME 2024-2025

OTIS AIME

ARC-AGI 2

ARC-AGI 2

Terminal-Bench 2

TermBench 2

Frontier Composite

Frontier

Output Stability

Stability

Format Adherence

Format

Recovery Rate

Recovery

Safety Handling

Safety

Methodology matches the main AI Model Analyzer About page.