Massive Multitask Language Understanding covers 57 subjects across STEM, the humanities, social sciences, and more.
Beta version: *Information might not be fully accurate. Please report any discrepancies.
Beta version: *Information might not be fully accurate. Please report any discrepancies.
Latest Data
Unknown
Context Window
200k
tokens
Input Cost
$3.00
per 1M tokens
Output Cost
$15.00
per 1M tokens
Parameters
175B (Estimated)
model footprint
Performance Analysis // Verified Benchmarks
Massive Multitask Language Understanding covers 57 subjects across STEM, the humanities, social sciences, and more.
Functional correctness of synthesized programs from docstrings.
Resolving real-world GitHub issues. Verified subset ensures solvable issues.
Multi-discipline Multimodal Understanding and Reasoning.
Next-generation HumanEval with more diverse library calls and complex tasks.
Chatbot Arena ELO score. Crowd-sourced human preference ranking.
Comprehensive framework to evaluate LLMs as agents across diverse environments.
Graduate-Level Google-Proof Q&A Benchmark.
Expert-level chemistry knowledge and reasoning.