Newsletter
Join the Community
Subscribe to our newsletter for the latest news and updates
Scenario-based large model testing toolbox for automating evaluations of large language models.

Nano Bananary is an AI batch image and video generator with 142 effects.

AI Podcast Generator for bilingual episodes, supporting multiple languages and alternative to NotebookLLM.

Zero-Config Code Flow for Claude code & Codex, enabling seamless integration and configuration for AI development.
GreatLibrarian is a scenario-based large model testing toolbox designed to automate the evaluation of large language models (LLMs). Users can provide an LLM's API key and test cases in JSON format to facilitate the entire evaluation process. The toolbox consists of several key modules:
With an architecture built in Python, GreatLibrarian allows for easy integration and customization of scoring metrics, providing an ideal solution for developers and researchers looking to benchmark their LLMs against varied test scenarios.