Newsletter
Join the Community
Subscribe to our newsletter for the latest news and updates
An open-source LLM evaluation framework for testing and evaluating large language model outputs.

Nano Bananary is an AI batch image and video generator with 142 effects.

AI Podcast Generator for bilingual episodes, supporting multiple languages and alternative to NotebookLLM.

Zero-Config Code Flow for Claude code & Codex, enabling seamless integration and configuration for AI development.
DeepEval is a simple-to-use, open-source LLM evaluation framework designed to test and evaluate large language models (LLMs) outputs. It aims to be a specialized unit testing tool similar to Pytest but tailored for LLM applications.
DeepEval equips developers and researchers alike with powerful tools to ensure their LLM systems meet high standards of performance and relevance.