Skip to content

Beta version: *Information might not be fully accurate. Please report any discrepancies.

Vision & Video

Evaluates visual understanding including image classification, object detection, video comprehension, and multimodal reasoning. Covers MMMU, VQA, video understanding, and cross-modal tasks.