Tag
Explore by tags

LLM 应用开发实践笔记
A comprehensive guide on LLM applications, covering LangChain, LlamaIndex, and HuggingGPT for developers.

LLMForEverybody
A knowledge-sharing platform about large language models for job interviews and general understanding.

so-large-lm
A comprehensive open-source tutorial on large-scale pre-trained language models covering theory and practical applications.

DeepEval
An open-source LLM evaluation framework for testing and evaluating large language model outputs.

CValues
A research project assessing and aligning the values of Chinese large language models focusing on safety and responsibility.

LettuceDetect
LettuceDetect is a hallucination detection framework for RAG applications.

Chinese LLM Benchmark
一个包含213个中文大模型的评测平台,提供评分排行榜和模型输出结果。

pytest-evals
A pytest plugin for running and analyzing LLM evaluation tests.

Opik
Open-source platform for debugging, evaluating, and monitoring LLM applications with comprehensive tracing and automated evaluations.

Mistral Cookbook
A collaborative repository for sharing examples showcasing Mistral models and tools developed by the community.

LangGraph-Reflection
A reflection agent that uses a two-agent system to validate and improve outputs in AI applications.
