HeadInfer is a memory-efficient inference framework for large language models that reduces GPU memory consumption.
100% FREE, Private deep learning chatbot utilizing advanced retrieval-augmented generation techniques.
:robot: The free, Open Source alternative to OpenAI, Claude and others. Self-hosted and local-first.
Chat with private and local large language models optimized for iOS devices.
Secure and local AI on your desktop with a built-in RAG knowledge base and Markdown note support.