Beta version: *Information might not be fully accurate. Please report any discrepancies.
Beta version: *Information might not be fully accurate. Please report any discrepancies.
Instruction Following Evaluation for Large Language Models. Measures ability to follow strict formatting and constraint requirements.
Score Distribution