Newsletter
Join the Community
Subscribe to our newsletter for the latest news and updates
F-Eval is a bilingual evaluation benchmark for assessing fundamental abilities in AI models.
F-Eval is a bilingual evaluation benchmark designed to assess fundamental abilities in AI models, including expression, commonsense reasoning, and logic. It consists of 2,211 instances in both English and Chinese, providing a comprehensive dataset for evaluation.