Beta version: *Information might not be fully accurate. Please report any discrepancies.

Zhipu AIVerifiedOpen Weights7 benchmarks

GLM-4.6

Released 2025-09-30355B MoE Architecture

Training: 2024-07

Verified Model Card

Latest Data

2026-02-20

Context Window

200k

tokens

Input Cost

$0.10

per 1M tokens

Output Cost

$0.30

per 1M tokens

Parameters

355B MoE

model footprint

Benchmark Provenance

Performance Analysis // Verified Benchmarks

MMLU (5-shot)Knowledge

86.5/ 100

Verified

Last Verified: 2025-09-30Z.ai Blog

Massive Multitask Language Understanding covers 57 subjects across STEM, the humanities, social sciences, and more.

MATH (CoT)Math

71.5*/ 100

Verified

Last Verified: 2026-02-16Artificial Analysis (Independent)

Challenging competition mathematics problems (AIME/IMO level).

HumanEvalCoding

82*/ 100

Verified

Last Verified: 2026-02-16Artificial Analysis (Independent)

Functional correctness of synthesized programs from docstrings.

SWE-bench VerifiedCoding

65.3*/ 100

Verified

Last Verified: 2026-02-16Artificial Analysis (Independent)

Resolving real-world GitHub issues. Verified subset ensures solvable issues.

LiveBenchReasoning

55.19/ 100

Verified

Last Verified: 2026-02-20LiveBench

Contamination-free, continuously updated reasoning benchmark.

LMArena ELOReal-world

1385/ 1700

Verified

Last Verified: 2026-02-18Chatbot Arena Leaderboard

Chatbot Arena ELO score. Crowd-sourced human preference ranking.

GPQA DiamondSTEM

78.2/ 100

Verified

Last Verified: 2026-02-18LLM Stats

Graduate-Level Google-Proof Q&A Benchmark.