Search
Collection
Category
Tag
Blog
Pricing
Submit

Newsletter

Join the Community

Subscribe to our newsletter for the latest news and updates

Email

AISecKit

Curated AI security tools & LLM safety resources for cybersecurity professionals

Product

Search
Collection
Category
Tag

Resources

Blog
Pricing
Submit

Tools

🔥Marathons Tools

Company

About Us
Privacy Policy
Terms of Service
Sitemap

Copyright © 2025 All Rights Reserved.

Home
Category
LLM-Bias-Evaluation

LLM-Bias-Evaluation

A study evaluating geopolitical and cultural biases in large language models through dual-layered assessments.

image for LLM-Bias-Evaluation

Introduction

Information

Publisher
AISecKit
Websitegithub.com
Published date2025/04/28

Categories

AI Models
AI Ethics Resources
AI Research Papers

Tags

AI Ethics
Responsible AI
LLM
Model Evaluation
Bias Mitigation

More Products

image of Nano Bananary

AI ModelsAI Application PlatformsAI Video Tools

Nano Bananary

Nano Bananary is an AI batch image and video generator with 142 effects.

Text-to-Video Generative AI

image of Awesome Public Datasets

AI ModelsAI Application PlatformsAI Productivity Tools

Awesome Public Datasets

A topic-centric list of HQ open datasets for various fields and applications.

image of agentic-design-patterns-cn

AI Application PlatformsAI Research PapersAI Development Frameworks

agentic-design-patterns-cn

A bilingual Chinese-English translation of 'Agentic Design Patterns' by Antonio Gulli, focusing on intelligent systems design.

AI Reasoning Open Source AI Education AI Standards AI Communities+1

LLM-Bias-Evaluation

Overview

This repository contains the dataset, evaluation scripts, and results for analyzing geopolitical and cultural biases in large language models (LLMs). The study is structured into two evaluation phases: factual QA (objective knowledge) and disputable QA (politically sensitive disputes). We explore how LLMs exhibit model bias (training-induced) and inference bias (query language-induced) when answering questions in different languages.

Key Features

Dual-Layered Evaluation: Conducts both factual and disputable QA to assess biases.
Comprehensive Dataset: Includes datasets for both factual and disputable questions, translated and verified in multiple languages.
Evaluation Scripts: Provides scripts for running evaluations and generating responses from various models.
Bias Analysis: Analyzes model bias and inference bias through various metrics and evaluation methods.

Benefits

Insightful Findings: Offers insights into how LLMs respond to geopolitical and cultural questions, highlighting biases.
Open Source: Available for researchers and developers to utilize and contribute to.
Multilingual Support: Evaluates responses in multiple languages, enhancing the study's relevance across different cultures.

Highlights

Investigates biases in LLMs through two phases: factual and disputable QA.
Includes detailed analysis of model and inference biases.
Provides scripts for easy execution of evaluations and response generation.