Newsletter
Join the Community
Subscribe to our newsletter for the latest news and updates
SGLang is a fast serving framework for large language models and vision language models.
SGLang is a fast serving framework designed for large language models (LLMs) and vision language models. It enhances interaction with models by co-designing the backend runtime and frontend language, making it faster and more controllable.