Everything you need to know to build your own RAG application.
FlashMLA is an efficient MLA decoding kernel optimized for Hopper GPUs, delivering significant performance improvements.
AgentSociety is a framework for large-scale social simulation using LLM-driven agents to model human behaviors and society.
A Next.js project template for building conversational web applications.
A GitHub repository exploring LLMs as coding tutors with a focus on dialogue tutoring agents.
Experience email the way you want with 0 – the first open source email app that puts your privacy and safety first.
HeadInfer is a memory-efficient inference framework for large language models that reduces GPU memory consumption.
A knowledge-sharing platform about large language models for job interviews and general understanding.
This React component renders Markdown as visually appealing social media images with a built-in web editor.
A free and open-source resume builder focused on privacy, customizability, and ease of use.
A powerful Python script for macOS that leverages mdfind for fast file searching.
Claude Code is an agentic coding tool that helps you code faster by executing tasks and explaining code through natural language commands.