Search
Collection
Category
Tag
Blog
Pricing
Submit

Newsletter

Join the Community

Subscribe to our newsletter for the latest news and updates

Email

AISecKit

Curated AI security tools & LLM safety resources for cybersecurity professionals

Product

Search
Collection
Category
Tag

Resources

Blog
Pricing
Submit

Tools

🔥Marathons Tools

Company

About Us
Privacy Policy
Terms of Service
Sitemap

Copyright © 2025 All Rights Reserved.

Home
Category
jailbreak_llms

jailbreak_llms

A dataset of 15,140 ChatGPT prompts, including 1,405 jailbreak prompts, collected from various platforms for research purposes.

image for jailbreak_llms

Introduction

Information

Publisher
AISecKit
Websitegithub.com
Published date2025/04/26

Categories

AI Ethics Resources
AI Research Papers
Jailbreak Prevention

Tags

AI Ethics
Model Robustness
Jailbreak Detection
Security Auditing
Open Source
Responsible AI

More Products

image of agentic-design-patterns-cn

AI Application PlatformsAI Research PapersAI Development Frameworks

agentic-design-patterns-cn

A bilingual Chinese-English translation of 'Agentic Design Patterns' by Antonio Gulli, focusing on intelligent systems design.

AI Reasoning Open Source AI Education AI Standards AI Communities+1

image of TradingAgents-CN

AI Application PlatformsAI Research PapersAI Development Frameworks

TradingAgents-CN

基于多智能体LLM的中文金融交易框架，支持A股/港股/美股分析。

Market Analysis Open Source LLM AI Communities Generative AI+1

LangFair

LangFair is a Python library for conducting use-case level LLM bias and fairness assessments.

Responsible AI LLM Bias Mitigation

Introduction

The jailbreak_llms project is a comprehensive dataset consisting of 15,140 ChatGPT prompts sourced from platforms like Reddit, Discord, and various websites. This dataset includes 1,405 jailbreak prompts, making it the largest collection of in-the-wild jailbreak prompts to date. The data was collected over a year, from December 2022 to December 2023, and is intended for research purposes, particularly in evaluating the effectiveness of jailbreak prompts on large language models (LLMs).

Key Features

Extensive Dataset: Over 15,000 prompts collected from diverse online platforms.
Jailbreak Focus: Specifically identifies and categorizes jailbreak prompts.
Research Utility: Aimed at understanding and mitigating risks associated with LLMs.
Ethical Considerations: Follows ethical guidelines to ensure responsible use of data.

Benefits

For Researchers: Provides a valuable resource for studying the vulnerabilities of LLMs.
For Developers: Helps in developing stronger safeguards against harmful prompts.
Awareness Raising: Informs the community about potential misuse of LLMs and encourages responsible AI development.

Highlights

Framework: Utilizes the JailbreakHub framework for measurement studies.
Evaluation: Includes a question set for evaluating the effectiveness of jailbreak prompts across various scenarios.
Open Source: Licensed under the MIT license, promoting collaboration and transparency.