Beta version: *Information might not be fully accurate. Please report any discrepancies.
Beta version: *Information might not be fully accurate. Please report any discrepancies.
Factuality benchmark across grounding, parametric, search, and multimodal.
Score Distribution