Beta version: *Information might not be fully accurate. Please report any discrepancies.

AA-LCR

lcrLong ContextHigher is better

Artificial Analysis Long Context Reasoning benchmark. Evaluates reasoning over long contexts.

Loading chart…

Loading leaderboard…

Benchmark Info

Current SOTA

Best Open Source

Median Score58.7%

DistributionClustered

Score Distribution

P10

P50

P90

0100%

MRCR v2

mrcr-v2

LongBench v2

longbench-v2

Graphwalks Bfs

graphwalks-bfs