Newsletter
Join the Community
Subscribe to our newsletter for the latest news and updates
A fork of llama.cpp with enhancements for performance and state-of-the-art quantization methods.

Nano Bananary is an AI batch image and video generator with 142 effects.

AI Podcast Generator for bilingual episodes, supporting multiple languages and alternative to NotebookLLM.

Zero-Config Code Flow for Claude code & Codex, enabling seamless integration and configuration for AI development.
ik_llama.cpp is a optimized fork of the original llama.cpp framework, providing enhanced performance and improved CPU matrix multiplications for various quantization types. It implements advanced techniques for prompt processing and token generation, leveraging powerful capabilities of CPUs like Ryzen-7950X and M2-Max.