LogoAISecKit
  • Search
  • Collection
  • Category
  • Tag
  • Blog
  • Pricing
  • Submit
LogoAISecKit

Newsletter

Join the Community

Subscribe to our newsletter for the latest news and updates

LogoAISecKit

Curated directory of 1700+ AI tools, models, frameworks, MCP servers, and cybersecurity resources

GitHub
Product
  • Search
  • Collection
  • Category
  • Tag
Resources
  • Blog
  • Pricing
  • Submit
Company
  • About Us
  • Privacy Policy
  • Terms of Service
  • Sitemap
Copyright © 2026 All Rights Reserved.
Sponsored Resources
  1. Home
  2. Category
  3. llama-swap
icon of llama-swap

llama-swap

Model swapping for llama.cpp or any local OpenAPI compatible server, providing automatic model management.

Visit Website
image for llama-swap
Visit Website

Introduction

llama-swap

llama-swap is a lightweight, transparent proxy server designed for automatic model swapping with llama.cpp or any local OpenAPI compatible server. Written in Golang, it is easy to install and configure, requiring only a single binary and a simple YAML configuration file.

Key Features:
  • Automatic Model Swapping: Automatically replaces the upstream server with the correct one based on the model requested.
  • Simple Configuration: Uses a single YAML file for configuration, making it user-friendly.
  • Multiple Model Support: Can handle multiple models simultaneously through profiles.
  • Docker Support: Easily deployable using Docker, with pre-built images available.
  • Health Monitoring: Includes health checks and logging capabilities for monitoring server status.
Benefits:
  • Flexibility: Works with any OpenAI compatible server, not just llama-server.
  • Performance Optimization: Supports speculative decoding and code generation optimization for improved inference speeds.
  • Resource Management: Provides control over system resources and automatic unloading of models after a specified timeout.
Highlights:
  • Supports various OpenAI API endpoints including completions, chat completions, embeddings, and more.
  • Easy to deploy on bare metal or via Docker, with pre-built binaries available for multiple operating systems.
  • Community-driven with active contributions and updates.
Back

Information

  • Publisher
    AISecKit
  • Websitegithub.com
  • Published date2025/04/28

Categories

  • AI Models
  • AI Application Platforms
  • AI Development Frameworks

Tags

    More Products

    image of Nano Bananary
    AI ModelsAI Application PlatformsAI Video Tools
    Visit Website
    icon of Nano Bananary

    Nano Bananary

    Nano Bananary is an AI batch image and video generator with 142 effects.

    Text-to-VideoGenerative AI
    image of Twocast
    AI Application PlatformsAI Productivity ToolsAI Audio Tools
    Visit Website
    icon of Twocast

    Twocast

    AI Podcast Generator for bilingual episodes, supporting multiple languages and alternative to NotebookLLM.

    Content Creation
    image of ZCF
    AI Application PlatformsAI Productivity ToolsAI Development Frameworks
    Visit Website
    icon of ZCF

    ZCF

    Zero-Config Code Flow for Claude code & Codex, enabling seamless integration and configuration for AI development.

    Open SourceClaude