Skip to content

Beta version: *Information might not be fully accurate. Please report any discrepancies.

Coding

Tests programming proficiency across multiple languages, software engineering tasks, debugging capabilities, and real-world coding scenarios. Includes HumanEval, MBPP, SWE-bench, and competitive programming benchmarks.