A pytest plugin for running and analyzing LLM evaluation tests.
QA-Pilot is an interactive chat project that leverages online/local LLM for rapid understanding and navigation of GitHub code repository.
Open-source platform for debugging, evaluating, and monitoring LLM applications with comprehensive tracing and automated evaluations.
Pragmatic framework to build LLM Copilots.
Automated web vulnerability scanning with LLM agents.
A reflection agent that uses a two-agent system to validate and improve outputs in AI applications.
A repository of common questions and answers for large model algorithm interviews.
An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.
A curated list of 120+ LLM libraries categorized for various applications and frameworks.
A framework for optimizing prompts with a self-evolving mechanism for better task performance.
Open source LLM engineering platform for observability, metrics, evals, and prompt management.