Beta version: *Information might not be fully accurate. Please report any discrepancies.
Multi-Round Context Retrieval - 8-needle test.
Score Distribution
LongBench v2
longbench-v2
AA-LCR
lcr
Graphwalks Bfs
graphwalks-bfs