Beta version: *Information might not be fully accurate. Please report any discrepancies.

MistralVerifiedOpen Weights7 benchmarks

Mistral Small 4

Released 2026-03-16119B total (6B active) Architecture

Verified Model Card

Latest Data

2026-03-16

Context Window

256k

tokens

Input Cost

$0.15

per 1M tokens

Output Cost

$0.60

per 1M tokens

Parameters

119B total (6B active)

model footprint

Benchmark Provenance

Performance Analysis // Verified Benchmarks

MMLU-ProScience

78/ 100

Verified

Last Verified: 2026-03-16Mistral Small 4 Announcement

A more robust and harder version of MMLU, focusing on complex reasoning and STEM subjects.

LiveCodeBench v6Coding

63.6/ 100

Verified

Last Verified: 2026-03-16Mistral Small 4 Announcement

Contamination-free coding benchmark using recent problems.

GPQA DiamondSTEM

71.2/ 100

Verified

Last Verified: 2026-03-16Mistral Small 4 Announcement

Graduate-Level Google-Proof Q&A Benchmark.

AA-LCRLong Context

71.2/ 100

Verified

Last Verified: 2026-03-16Mistral Small 4 Announcement

Artificial Analysis Long Context Reasoning benchmark. Evaluates reasoning over long contexts.

IFBenchInstruction Following

48/ 100

Verified

Last Verified: 2026-03-16Mistral Small 4 Announcement

Artificial Analysis IFBench. Evaluates precise instruction following with constraints.

AIME 2025Math

83.8/ 100

Verified

Last Verified: 2026-03-16Mistral Small 4 Announcement

American Invitational Mathematics Examination 2025 problems.

MMMU-ProVision

60/ 100

Verified

Last Verified: 2026-03-16Mistral Small 4 Announcement

Professional level MMMU expansion.