Newsletter
Join the Community
Subscribe to our newsletter for the latest news and updates
JailBench is a comprehensive Chinese dataset for assessing jailbreak attack risks in large language models.
JailBench is a large-scale dataset designed to evaluate the jailbreak attack risks of large language models in the Chinese context. It is aligned with the national cybersecurity standards and aims to provide a thorough assessment of the security vulnerabilities in AI-generated content.