Resolving real-world GitHub issues. Verified subset ensures solvable issues.
Beta version: *Information might not be fully accurate. Please report any discrepancies.
Beta version: *Information might not be fully accurate. Please report any discrepancies.
Latest Data
2026-05-21
Context Window
1.0M
tokens
Input Cost
$3.00
per 1M tokens
Output Cost
$15.00
per 1M tokens
Cache Cost
$0.30 / $3.75
read / write per 1M
Parameters
Unknown
model footprint
Performance Analysis // Verified Benchmarks
Resolving real-world GitHub issues. Verified subset ensures solvable issues.
Chatbot Arena ELO score. Crowd-sourced human preference ranking.
WebDev Arena ELO score. Human preference ranking for web development tasks.
Vision Arena ELO score. Human preference ranking for multimodal vision tasks.
Search Arena ELO score. Human preference ranking for search-augmented generation.
Document Arena ELO score. Human preference ranking for document understanding.
Humanity's Last Exam full evaluation with tool access enabled.
Graduate-Level Google-Proof Q&A Benchmark.
Abstraction and Reasoning Corpus - Level 2 (Extreme difficulty).