LogoAISecKit
  • Search
  • Collection
  • Category
  • Tag
  • Blog
  • Pricing
  • Submit
LogoAISecKit

Newsletter

Join the Community

Subscribe to our newsletter for the latest news and updates

LogoAISecKit

Curated directory of 1700+ AI tools, models, frameworks, MCP servers, and cybersecurity resources

GitHub
Product
  • Search
  • Collection
  • Category
  • Tag
Resources
  • Blog
  • Pricing
  • Submit
Company
  • About Us
  • Privacy Policy
  • Terms of Service
  • Sitemap
Copyright © 2026 All Rights Reserved.
Sponsored Resources
  1. Home
  2. Category
  3. Evaluation-Multimodal-LLMs-Survey
icon of Evaluation-Multimodal-LLMs-Survey

Evaluation-Multimodal-LLMs-Survey

A comprehensive survey on benchmarks for Multimodal Large Language Models (MLLMs).

Visit Website
image for Evaluation-Multimodal-LLMs-Survey
Visit Website

Introduction

Evaluation of Multimodal Large Language Models (MLLMs)

This repository presents a detailed survey on the benchmarks of Multimodal Large Language Models (MLLMs), focusing on their performance across various applications such as visual question answering, visual perception, understanding, and reasoning. The survey reviews over 200 benchmarks and evaluations, categorized into key areas:

Key Features:
  • Comprehensive Evaluation: In-depth analysis of MLLMs from multiple perspectives including perception, cognition, and reasoning.
  • Diverse Applications: Covers applications in specific domains such as healthcare, autonomous driving, and more.
  • Future Directions: Discusses limitations of current evaluation methods and explores promising future research directions.
Benefits:
  • Research Collaboration: Encourages collaboration on academic research and writing papers.
  • Active Maintenance: The repository will be regularly updated with new research findings.
Highlights:
  • Focus on key capabilities like conversation abilities, hallucination, and trustworthiness.
  • Exploration of various modalities including videos, audio, and 3D points.

For more information, visit the GitHub repository.

Back

Information

  • Publisher
    AISecKit
  • Websitegithub.com
  • Published date2025/04/28

Categories

  • AI Models
  • AI Application Platforms
  • AI Research Papers

Tags

  • Foundation Models
  • Multimodal LLMs
  • AI Reasoning
  • Model Evaluation

More Products

image of Nano Bananary
AI ModelsAI Application PlatformsAI Video Tools
Visit Website
icon of Nano Bananary

Nano Bananary

Nano Bananary is an AI batch image and video generator with 142 effects.

Text-to-VideoGenerative AI
image of Twocast
AI Application PlatformsAI Productivity ToolsAI Audio Tools
Visit Website
icon of Twocast

Twocast

AI Podcast Generator for bilingual episodes, supporting multiple languages and alternative to NotebookLLM.

Content Creation
image of ZCF
AI Application PlatformsAI Productivity ToolsAI Development Frameworks
Visit Website
icon of ZCF

ZCF

Zero-Config Code Flow for Claude code & Codex, enabling seamless integration and configuration for AI development.

Open SourceClaude