Newsletter
Join the Community
Subscribe to our newsletter for the latest news and updates
A unified evaluation framework for large language models.
PromptBench is a Pytorch-based Python package designed for the evaluation of Large Language Models (LLMs). It provides user-friendly APIs for researchers to conduct evaluations on LLMs efficiently. Here are some key features and benefits:
For more information, visit the GitHub repository.