
LangChain is a framework for building LLM-powered applications, simplifying AI application development.

MedRAX is a versatile AI agent for integrated chest X-ray analysis and medical reasoning.

An experimental project exploring the use of Large Language Models (LLMs) to solve HackTheBox machines autonomously.

Everything you need to know to build your own RAG application.

FlashMLA is an efficient MLA decoding kernel optimized for Hopper GPUs, delivering significant performance improvements.

A GitHub repository exploring LLMs as coding tutors with a focus on dialogue tutoring agents.

HeadInfer is a memory-efficient inference framework for large language models that reduces GPU memory consumption.

A knowledge-sharing platform about large language models for job interviews and general understanding.

LLM API management & key redistribution system for various AI models, supporting unified API access and easy deployment.