Newsletter
Join the Community
Subscribe to our newsletter for the latest news and updates
Sample notebooks and prompts for evaluating large language models (LLMs) and generative AI.

Nano Bananary is an AI batch image and video generator with 142 effects.

AI Podcast Generator for bilingual episodes, supporting multiple languages and alternative to NotebookLLM.

Zero-Config Code Flow for Claude code & Codex, enabling seamless integration and configuration for AI development.
The LLM Evaluation repository provides a collection of sample notebooks and prompts designed for evaluating large language models (LLMs) and generative AI systems. This resource is particularly useful for researchers and practitioners looking to understand and assess the performance of LLMs in various contexts.