
Faster Whisper transcription with CTranslate2.

A high-throughput and memory-efficient inference and serving engine for LLMs.

Replace OpenAI GPT with another LLM in your app by changing a single line of code.

:robot: The free, Open Source alternative to OpenAI, Claude and others. Self-hosted and local-first.

SGLang is a fast serving framework for large language models and vision language models.

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation.

R1-Onevision is a visual language model capable of deep CoT reasoning.

A prompt optimizer tool that helps in writing high-quality prompts for AI models.

A comprehensive open-source tutorial on large-scale pre-trained language models covering theory and practical applications.

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs with zero-code CLI and Web UI.

A simple technical tutorial project focusing on explaining interesting and cutting-edge technology concepts in under 5 minutes.