LogoAISecKit
  • Search
  • Collection
  • Category
  • Tag
  • Blog
  • Pricing
  • Submit
LogoAISecKit

Newsletter

Join the Community

Subscribe to our newsletter for the latest news and updates

LogoAISecKit

Curated directory of 1700+ AI tools, models, frameworks, MCP servers, and cybersecurity resources

GitHub
Product
  • Search
  • Collection
  • Category
  • Tag
Resources
  • Blog
  • Pricing
  • Submit
Company
  • About Us
  • Privacy Policy
  • Terms of Service
  • Sitemap
Copyright © 2026 All Rights Reserved.
Sponsored Resources
  1. Home
  2. Category
  3. Awesome LLMs Evaluation Papers
icon of Awesome LLMs Evaluation Papers

Awesome LLMs Evaluation Papers

A comprehensive collection of papers focused on evaluating large language models (LLMs).

Visit Website
image for Awesome LLMs Evaluation Papers
Visit Website

Introduction

Awesome LLMs Evaluation Papers

This repository provides a curated list of papers organized according to the survey Evaluating Large Language Models: A Comprehensive Survey.

Key Features
  • Comprehensive coverage of evaluation methodologies across various aspects of LLMs.
  • Categorized papers including Knowledge and Capability Evaluation, Alignment Evaluation, and Safety Evaluation.
  • Includes benchmarks and leaderboards for LLM performance.
  • Regular updates with new research contributions.
Benefits
  • Serves as a valuable resource for researchers and practitioners in the field of AI and LLMs.
  • Facilitates a better understanding of the capabilities and risks associated with large language models.
  • Promotes community involvement in maintaining and expanding the paper list.
Highlights
  • Authors include recognized contributors from Tianjin University and other institutions.
  • Encourages citation and feedback to enhance the resource.
Back

Information

  • Publisher
    AISecKit
  • Websitegithub.com
  • Published date2025/04/28

Categories

  • AI Ethics Resources
  • AI Research Papers

Tags

  • AI Ethics
  • Foundation Models
  • AI Alignment
  • Research Papers
  • Model Evaluation
  • Bias Mitigation

More Products

image of agentic-design-patterns-cn
AI Application PlatformsAI Research PapersAI Development Frameworks
Visit Website
icon of agentic-design-patterns-cn

agentic-design-patterns-cn

A bilingual Chinese-English translation of 'Agentic Design Patterns' by Antonio Gulli, focusing on intelligent systems design.

AI ReasoningOpen SourceAI EducationAI StandardsAI Communities+1
image of TradingAgents-CN
AI Application PlatformsAI Research PapersAI Development Frameworks
Visit Website
icon of TradingAgents-CN

TradingAgents-CN

基于多智能体LLM的中文金融交易框架,支持A股/港股/美股分析。

Market AnalysisOpen SourceLLMAI CommunitiesGenerative AI+1
L
AI ModelsAI Application PlatformsAI Ethics Resources
Visit Website
icon of LangFair

LangFair

LangFair is a Python library for conducting use-case level LLM bias and fairness assessments.

Responsible AILLMBias Mitigation