AMA

Head-to-head

Claude Sonnet 4 (Thinking) vs Gemini 3 Flash

Normalized scores are min-maxed per benchmark across all models we track (0–100). Open the interactive compare view to add benchmarks to the radar chart or pull in more models.

Claude Sonnet 4 (Thinking)
Anthropic
Gemini 3 Flash
Google
BenchmarkClaude Sonnet 4 (Thinking)Gemini 3 Flash
Chatbot Arena Elo

Arena

LiveCodeBench

LiveCB

SWE-bench Verified

SWE-bench

Output Speed

Speed

Time to First Token

TTFT

Rolling Contamination-Controlled Average

Rolling Avg

Rolling Data Analysis

Data Analysis

FrontierMath Tiers 1-3

FrontierMath

SimpleQA Verified

SimpleQA

OTIS Mock AIME 2024-2025

OTIS AIME

Humanity's Last Exam

HLE

ARC-AGI 2

ARC-AGI 2

Terminal-Bench 2

TermBench 2

Frontier Composite

Frontier

Methodology matches the main AI Model Analyzer About page.