MMLU (5-shot)Knowledge
87.9*/ 100
Verified
Last Verified: Unknown DateArtificial Analysis (Independent)
Massive Multitask Language Understanding covers 57 subjects across STEM, the humanities, social sciences, and more.
Beta version: *Information might not be fully accurate. Please report any discrepancies.
Latest Data
Unknown
Context Window
131k
tokens
Input Cost
$1.60
per 1M tokens
Output Cost
$6.40
per 1M tokens
Parameters
Unknown
model footprint
Performance Analysis // Verified Benchmarks
Massive Multitask Language Understanding covers 57 subjects across STEM, the humanities, social sciences, and more.
Challenging competition mathematics problems (AIME/IMO level).
Functional correctness of synthesized programs from docstrings.
A more robust and harder version of MMLU, focusing on complex reasoning and STEM subjects.