Newsletter
Join the Community
Subscribe to our newsletter for the latest news and updates
Scenario-based large model testing toolbox for automating evaluations of large language models.
GreatLibrarian is a scenario-based large model testing toolbox designed to automate the evaluation of large language models (LLMs). Users can provide an LLM's API key and test cases in JSON format to facilitate the entire evaluation process. The toolbox consists of several key modules:
With an architecture built in Python, GreatLibrarian allows for easy integration and customization of scoring metrics, providing an ideal solution for developers and researchers looking to benchmark their LLMs against varied test scenarios.