A curated list of useful resources that cover Offensive AI.
A Python toolbox for adversarial robustness research, implemented in PyTorch.
A Python library designed to enhance machine learning security against adversarial threats.
A Python toolbox to create adversarial examples that fool neural networks in PyTorch, TensorFlow, and JAX.
An adversarial example library for constructing attacks, building defenses, and benchmarking both.
A prompt injection game to collect data for robust ML research.
This paper discusses new methods for generating transferable adversarial attacks on aligned language models, improving LLM security.