Newsletter
Join the Community
Subscribe to our newsletter for the latest news and updates
GitHub repository for optimization-based prompt injection attacks on LLMs as judges.

A bilingual Chinese-English translation of 'Agentic Design Patterns' by Antonio Gulli, focusing on intelligent systems design.

基于多智能体LLM的中文金融交易框架,支持A股/港股/美股分析。
JudgeDeceiver is an open-source tool developed for conducting optimization-based prompt injection attacks on large language models (LLMs) that function as judges. This tool was released alongside the paper presented at ACM CCS 2024, detailing methods to exploit prompt injection vulnerabilities in LLMs.
This repository not only serves as a tool for testing and enhancing model robustness but also acts as a resource for the ongoing research community focused on AI safety and security.