GLM-4.6

Name: GLM-4.6
Brand: Zhipu
Price: 0.5 USD
Rating: 44.9 (5 reviews)

Open source

Self-hostable

Free to run

Zhipu AI (GLM)

Open license

text

GLM-4Released 9mo ago

Avg score

44.9

/ 100

Context

200k

Output limit

16k

Input price

$0.50 /M

Output price

$1.75 /M

Parameters

356.8B

Est. VRAM (Q4)

~214.1 GB

Pricing verified 2mo ago · Estimate; verify on z.ai or hosted-endpoint pricing pages before relying on this.

Benchmarks

preference

Chatbot Arena EloFresh

Elo

Crowdsourced pairwise human preference rankings of LLM responses. Higher Elo means more frequently preferred by users.

agentic

SWE-bench VerifiedSome risk

% resolved

Real GitHub issues solved end-to-end. Verified subset is a 500-task human-validated slice of SWE-bench.

Terminal-Bench 2Fresh

Long-horizon shell-and-filesystem tasks executed in a sandboxed terminal, scored by whether the agent's final state matches a target state. Tests practical tool-using ability for everyday devops and data-wrangling work; one of the hardest agentic benchmarks today.

math

FrontierMath Tiers 1-3Fresh

Mathematical research problems spanning analysis, algebra, combinatorics and number theory. Tiers 1-3 are progressively harder; even frontier reasoning models only solve a small fraction. The hardest publicly reported benchmark for general mathematical reasoning.

composite

Frontier CompositeFresh

ECI

Saturation-resistant composite capability score stitched together from ~40 underlying benchmarks using Item Response Theory. Each benchmark is weighted by its fitted difficulty and discriminative slope, so doing well on hard, contamination-resistant evals (FrontierMath, ARC-AGI 2, Humanity's Last Exam) moves the score and saturated benchmarks contribute almost nothing. Imported per-model from Epoch AI's published index; we anchor it to the same min-max scale we use for every other benchmark so it's directly weightable in scenarios.

Reliability monitor

Loading drift signal…

Hosted endpoints

No third-party hosts tracked for this model — available only from its primary provider.

Compare with...

vs GPT-4o vs GPT-4o mini vs o1 vs o1-mini vs o3 vs o4-mini vs o3-mini vs GPT-4 Turbo vs GPT-4.1 vs GPT-5