Newsletter
Join the Community
Subscribe to our newsletter for the latest news and updates
A comprehensive survey on benchmarks for Multimodal Large Language Models (MLLMs).

Nano Bananary is an AI batch image and video generator with 142 effects.

AI Podcast Generator for bilingual episodes, supporting multiple languages and alternative to NotebookLLM.

Zero-Config Code Flow for Claude code & Codex, enabling seamless integration and configuration for AI development.
This repository presents a detailed survey on the benchmarks of Multimodal Large Language Models (MLLMs), focusing on their performance across various applications such as visual question answering, visual perception, understanding, and reasoning. The survey reviews over 200 benchmarks and evaluations, categorized into key areas:
For more information, visit the GitHub repository.