MMLU (5-shot)Knowledge
81*/ 100
Verified
Last Verified: Unknown DateArtificial Analysis (Independent)
Massive Multitask Language Understanding covers 57 subjects across STEM, the humanities, social sciences, and more.
Beta version: *Information might not be fully accurate. Please report any discrepancies.
Latest Data
Unknown
Context Window
32k
tokens
Input Cost
Free
per 1M tokens
Output Cost
Free
per 1M tokens
Parameters
34B
model footprint
Performance Analysis // Verified Benchmarks
Massive Multitask Language Understanding covers 57 subjects across STEM, the humanities, social sciences, and more.
Challenging competition mathematics problems (AIME/IMO level).
Functional correctness of synthesized programs from docstrings.
Multi-discipline Multimodal Understanding and Reasoning.
Chatbot Arena ELO score. Crowd-sourced human preference ranking.
Graduate-Level Google-Proof Q&A Benchmark.