Newsletter
Join the Community
Subscribe to our newsletter for the latest news and updates
Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.
Evals is an open-source framework designed for evaluating large language models (LLMs) and LLM systems. It provides a comprehensive registry of benchmarks and allows users to create custom evaluations tailored to their specific use cases.