Newsletter
Join the Community
Subscribe to our newsletter for the latest news and updates
This repository contains the code for generating the ToxiGen dataset for hate speech detection.
ToxiGen is a large-scale machine-generated dataset designed for adversarial and implicit hate speech detection, published at ACL 2022. This repository includes the necessary code and tools to generate the ToxiGen dataset, which contains implicitly toxic and benign sentences mentioning 13 minority groups. The dataset aims to train classifiers to detect subtle hate speech that does not include slurs or profanity.