Newsletter
Join the Community
Subscribe to our newsletter for the latest news and updates
A pytest plugin for running and analyzing LLM evaluation tests.

Nano Bananary is an AI batch image and video generator with 142 effects.

AI Podcast Generator for bilingual episodes, supporting multiple languages and alternative to NotebookLLM.

Zero-Config Code Flow for Claude code & Codex, enabling seamless integration and configuration for AI development.
pytest-evals is a minimalistic pytest plugin designed to help developers run and analyze evaluation tests for Large Language Models (LLMs). It simplifies the process of testing LLM outputs against predefined examples, ensuring that your models perform as expected over time.
pip install pytest-evals.