Beta version: *Information might not be fully accurate. Please report any discrepancies.
Beta version: *Information might not be fully accurate. Please report any discrepancies.
Multimodal browse + synthesize benchmark for web agents.
Score Distribution