LogoAISecKit
  • Search
  • Collection
  • Category
  • Tag
  • Blog
  • Pricing
  • Submit
LogoAISecKit

Newsletter

Join the Community

Subscribe to our newsletter for the latest news and updates

LogoAISecKit

Curated directory of 1700+ AI tools, models, frameworks, MCP servers, and cybersecurity resources

GitHub
Product
  • Search
  • Collection
  • Category
  • Tag
Resources
  • Blog
  • Pricing
  • Submit
Company
  • About Us
  • Privacy Policy
  • Terms of Service
  • Sitemap
Copyright © 2026 All Rights Reserved.
Sponsored Resources
  1. Home
  2. Category
  3. TOXIGEN
icon of TOXIGEN

TOXIGEN

This repository contains the code for generating the ToxiGen dataset for hate speech detection.

Visit Website
image for TOXIGEN
Visit Website

Introduction

ToxiGen

ToxiGen is a large-scale machine-generated dataset designed for adversarial and implicit hate speech detection, published at ACL 2022. This repository includes the necessary code and tools to generate the ToxiGen dataset, which contains implicitly toxic and benign sentences mentioning 13 minority groups. The dataset aims to train classifiers to detect subtle hate speech that does not include slurs or profanity.

Key Features:
  • Dataset Generation: Code for generating the ToxiGen dataset using pretrained language models like GPT-3.
  • ALICE Tool: A tool to stress test content moderation systems and improve their performance across minority groups.
  • Human Annotations: Includes 27,450 human annotations for better dataset quality and reliability.
  • Community Contributions: Encourages users to contribute new prompts and data generation methods.
  • Pretrained Classifiers: Provides checkpoints for HateBERT and RoBERTa models fine-tuned on ToxiGen data.
Benefits:
  • Research Utility: Designed for research purposes to improve toxicity detection methods.
  • Open Source: Available for community contributions and enhancements.
  • Responsible AI Considerations: Acknowledges the complexities of problematic language and encourages multidisciplinary research.
Highlights:
  • Released source codes and prompt seeds to foster community engagement.
  • Available on HuggingFace for easy access and integration into projects.
  • Comprehensive documentation and examples provided for users to get started quickly.
Back

Information

  • Publisher
    AISecKit
  • Websitegithub.com
  • Published date2025/04/28

Categories

  • AI Models
  • AI Application Platforms
  • AI Ethics Resources

Tags

  • Synthetic Data
  • Open Source
  • Responsible AI
  • Content Moderation

More Products

image of Nano Bananary
AI ModelsAI Application PlatformsAI Video Tools
Visit Website
icon of Nano Bananary

Nano Bananary

Nano Bananary is an AI batch image and video generator with 142 effects.

Text-to-VideoGenerative AI
image of Twocast
AI Application PlatformsAI Productivity ToolsAI Audio Tools
Visit Website
icon of Twocast

Twocast

AI Podcast Generator for bilingual episodes, supporting multiple languages and alternative to NotebookLLM.

Content Creation
image of ZCF
AI Application PlatformsAI Productivity ToolsAI Development Frameworks
Visit Website
icon of ZCF

ZCF

Zero-Config Code Flow for Claude code & Codex, enabling seamless integration and configuration for AI development.

Open SourceClaude