Terminal-Bench 2.0Agentic
65.8/ 100
Verified
Last Verified: 2026-04-27Xiaomi MiMo
Agent performance in realistic terminal workflows (v2.0 leaderboard).
Beta version: *Information might not be fully accurate. Please report any discrepancies.
Latest Data
2026-04-27
Context Window
1.0M
tokens
Input Cost
$0.40
per 1M tokens
Output Cost
$2.00
per 1M tokens
Cache Cost
$0.08 / Free
read / write per 1M
Parameters
310B MoE (15B activated)
model footprint
Performance Analysis // Verified Benchmarks
Agent performance in realistic terminal workflows (v2.0 leaderboard).
Benchmark for daily agentic tasks across text and multimodal interactions.
Higher-difficulty SWE-bench subset for frontier coding agents.