Beta version: *Information might not be fully accurate. Please report any discrepancies.
Intelligence
Measures advanced cognitive capabilities including logical reasoning, scientific knowledge, multi-step problem solving, and the ability to tackle novel challenges. Includes benchmarks for GPQA, ARC-AGI, and other frontier reasoning tasks.
Top Models
Domain Info
- Benchmarks
- 35
- Models Evaluated
- 69
- Categories
- Reasoning, Science, STEM, Advanced Tasks