MMLU (5-shot)Knowledge
85.2/ 100
Verified
Last Verified: Unknown DateOpenAI Blog
Massive Multitask Language Understanding covers 57 subjects across STEM, the humanities, social sciences, and more.
Beta version: *Information might not be fully accurate. Please report any discrepancies.
Latest Data
Unknown
Context Window
128k
tokens
Input Cost
$1.10
per 1M tokens
Output Cost
$4.40
per 1M tokens
Cache Cost
$0.55 / Free
read / write per 1M
Parameters
Small Reasoning
model footprint
Performance Analysis // Verified Benchmarks
Massive Multitask Language Understanding covers 57 subjects across STEM, the humanities, social sciences, and more.
Functional correctness of synthesized programs from docstrings.
500-problem math benchmark for broad quantitative reasoning.
Graduate-Level Google-Proof Q&A Benchmark.