The Best Your Ultimate AI Security Toolkit

Curated AI security tools & LLM safety resources for cybersecurity professionals

DevSecOps ToolsPenetration TestingExploitation Frameworks

Visit Website

AdaptixC2

Details

AdaptixC2 is an extensible post-exploitation framework for penetration testers, supporting multi-platform operations.

Security Auditing Open Source Incident Response Penetration Testing

Network SecurityPenetration TestingSecurity Research

Visit Website

pugDNS

Details

An experimental high-performance DNS query bruteforce tool built with AF_XDP for bulk DNS lookups.

Open Source Penetration Testing

AI Application PlatformsAI Agent ToolsAI Development Frameworks

Visit Website

AgentOps

Details

Python SDK for AI agent monitoring, LLM cost tracking, benchmarking, and more.

LLM Monitoring

AI Application PlatformsAI Productivity ToolsAI Development Frameworks

Visit Website

RagaAI Catalyst

Details

Python SDK for Agent AI Observability, Monitoring and Evaluation Framework.

Compliance Model Evaluation

Other AI ToolsAI Agent ToolsAI Development Frameworks

Visit Website

awesome-llm-agents

Details

A curated list of awesome LLM agents frameworks.

Open Source LLM Agents

AI ModelsAI Application PlatformsAI Development Frameworks

Visit Website

LLMBox

Details

A comprehensive library for implementing LLMs with a unified training pipeline and model evaluation.

Fine-tuning Open Source LLM Model Evaluation

AI Application PlatformsAI Ethics ResourcesAI Research Papers

Visit Website

LLM Evaluation Guidebook

Details

A guidebook sharing insights and knowledge about evaluating Large Language Models (LLMs).

Prompt Engineering Responsible AI LLM Model Evaluation Bias Mitigation+1

AI Application PlatformsAI Productivity Tools

Visit Website

Latitude

Details

Latitude is the open-source prompt engineering platform to build, evaluate, and refine your prompts with AI.

Prompt Engineering Open Source

AI Ethics ResourcesAI Research Papers

Visit Website

Awesome LLMs Evaluation Papers

Details

A comprehensive collection of papers focused on evaluating large language models (LLMs).

AI Ethics Foundation Models AI Alignment Research Papers Model Evaluation+1

AI ModelsAI Application PlatformsAI Productivity Tools

Visit Website

LLM AutoEval

Details

Automatically evaluate your LLMs in Google Colab with LLM AutoEval.

LLM Model Evaluation

AI Ethics ResourcesAI Research Papers

Visit Website

SafetyBench

Details

Official GitHub repository for SafetyBench, a benchmark to evaluate the safety of large language models (LLMs).

Safety Alignments Open Source

AI ModelsAI Application PlatformsAI Code Tools

Visit Website

can-ai-code

Details

Self-evaluating interview for AI coders.

Open Source

The Best Your Ultimate AI Security Toolkit

All Categories

No Filter

Sort by Time (dsc)

All Categories

No Filter

Sort by Time (dsc)

AdaptixC2

pugDNS

AgentOps

RagaAI Catalyst

awesome-llm-agents

LLMBox

LLM Evaluation Guidebook

Latitude

Awesome LLMs Evaluation Papers

LLM AutoEval

SafetyBench

can-ai-code

AdaptixC2

pugDNS

AgentOps

RagaAI Catalyst

awesome-llm-agents

LLMBox

LLM Evaluation Guidebook

Latitude

Awesome LLMs Evaluation Papers

LLM AutoEval

SafetyBench

can-ai-code

The Best Your Ultimate AI Security Toolkit

All Categories

No Filter

Sort by Time (dsc)

All Categories

No Filter

Sort by Time (dsc)

AdaptixC2

pugDNS

AgentOps

RagaAI Catalyst

awesome-llm-agents

LLMBox

LLM Evaluation Guidebook

Latitude

Awesome LLMs Evaluation Papers

LLM AutoEval

SafetyBench

can-ai-code

Newsletter

Join the Community

AdaptixC2

pugDNS

AgentOps

RagaAI Catalyst

awesome-llm-agents

LLMBox

LLM Evaluation Guidebook

Latitude

Awesome LLMs Evaluation Papers

LLM AutoEval

SafetyBench

can-ai-code