Beta version: *Information might not be fully accurate. Please report any discrepancies.
Vision & Video
Evaluates visual understanding including image classification, object detection, video comprehension, and multimodal reasoning. Covers MMMU, VQA, video understanding, and cross-modal tasks.
Top Models
Domain Info
- Benchmarks
- 88
- Models Evaluated
- 34
- Categories
- Vision, Video, Multimodal