Beta version: *Information might not be fully accurate. Please report any discrepancies.

MistralVerifiedOpen Weights8 benchmarks

Mistral Medium 3.5

Released 2026-04-01128B Architecture

Verified Model Card

Latest Data

2026-04-01

Context Window

256k

tokens

Input Cost

$1.50

per 1M tokens

Output Cost

$7.50

per 1M tokens

Parameters

128B

model footprint

Benchmark Provenance

Performance Analysis // Verified Benchmarks

SWE-bench VerifiedCoding

77.6/ 100

Verified

Last Verified: 2026-04-01Mistral Medium 3.5 Announcement

Resolving real-world GitHub issues. Verified subset ensures solvable issues.

MATH-500Math

90/ 100

Verified

Last Verified: 2026-04-01Mistral Medium 3.5 Announcement

500-problem math benchmark for broad quantitative reasoning.

LiveCodeBench v6Coding

55.1/ 100

Verified

Last Verified: 2026-04-01Mistral Medium 3.5 Announcement

Contamination-free coding benchmark using recent problems.

GPQA DiamondSTEM

76.6/ 100

Verified

Last Verified: 2026-04-01Mistral Medium 3.5 Announcement

Graduate-Level Google-Proof Q&A Benchmark.

AIME 2025Math

72.8/ 100

Verified

Last Verified: 2026-04-01Mistral Medium 3.5 Announcement

American Invitational Mathematics Examination 2025 problems.

Aider PolyglotAgentic

68.4/ 100

Verified

Last Verified: 2026-04-01Mistral Medium 3.5 Announcement

Multi-language coding agent benchmark with editor-in-the-loop tasks.

TAU-Bench RetailAgentic

76.5/ 100

Verified

Last Verified: 2026-04-01Mistral Medium 3.5 Announcement

Retail-domain tool-use and workflow benchmark from τ²-bench.

TAU-Bench TelecomAgentic

91.4/ 100

Verified

Last Verified: 2026-04-01Mistral Medium 3.5 Announcement

Telecom-domain tool-use and workflow benchmark.