Newsletter
Join the Community
Subscribe to our newsletter for the latest news and updates
A curated list of tools, datasets, demos, and papers for evaluating large language models (LLMs).

Nano Bananary is an AI batch image and video generator with 142 effects.

AI Podcast Generator for bilingual episodes, supporting multiple languages and alternative to NotebookLLM.

Zero-Config Code Flow for Claude code & Codex, enabling seamless integration and configuration for AI development.
Awesome-LLM-Eval is a curated repository that provides a comprehensive list of resources for evaluating large language models (LLMs). This includes tools, datasets, benchmarks, demos, leaderboards, papers, and documentation, aimed at exploring the boundaries of generative AI technology.
Explore the repository to enhance your understanding and evaluation of large language models!