LogoAISecKit
  • Search
  • Collection
  • Category
  • Tag
  • Blog
  • Pricing
  • Submit
LogoAISecKit

Newsletter

Join the Community

Subscribe to our newsletter for the latest news and updates

LogoAISecKit

Curated directory of 1700+ AI tools, models, frameworks, MCP servers, and cybersecurity resources

GitHub
Product
  • Search
  • Collection
  • Category
  • Tag
Resources
  • Blog
  • Pricing
  • Submit
Company
  • About Us
  • Privacy Policy
  • Terms of Service
  • Sitemap
Copyright © 2026 All Rights Reserved.
Sponsored Resources
  1. Home
  2. Category
  3. CSM
icon of CSM

CSM

A Conversational Speech Generation Model that generates audio codes from text and audio inputs.

Visit Website
image for CSM
Visit Website

Introduction

CSM (Conversational Speech Model)

CSM is a state-of-the-art speech generation model developed by SesameAILabs. It is designed to generate RVQ audio codes from both text and audio inputs, utilizing a robust architecture that includes a Llama backbone and a specialized audio decoder for producing Mimi audio codes.

Key Features:
  • Audio Generation: Generates high-quality audio from text prompts.
  • Contextual Understanding: Capable of generating audio with context for more natural conversations.
  • Open Source: Available for research and educational purposes under the Apache-2.0 license.
  • Multi-Platform Support: Compatible with various operating systems, including Windows and Linux.
Benefits:
  • Research and Development: Ideal for researchers looking to explore conversational AI and speech synthesis.
  • Interactive Demos: Includes a fine-tuned variant that powers interactive voice demos.
  • Community Contributions: Encourages contributions and collaboration through GitHub.
Highlights:
  • Latest Release: The 1B CSM variant was released on March 13, 2025, with checkpoints hosted on Hugging Face.
  • Ethical Use: Strong emphasis on responsible and ethical applications of the technology, prohibiting misuse.
Back

Information

  • Publisher
    AISecKit
  • Websitegithub.com
  • Published date2025/04/28

Categories

  • AI Models
  • AI Application Platforms
  • AI Audio Tools

Tags

  • Llama Models
  • Open Source
  • Voice Assistants

More Products

image of Nano Bananary
AI ModelsAI Application PlatformsAI Video Tools
Visit Website
icon of Nano Bananary

Nano Bananary

Nano Bananary is an AI batch image and video generator with 142 effects.

Text-to-VideoGenerative AI
image of Twocast
AI Application PlatformsAI Productivity ToolsAI Audio Tools
Visit Website
icon of Twocast

Twocast

AI Podcast Generator for bilingual episodes, supporting multiple languages and alternative to NotebookLLM.

Content Creation
image of ZCF
AI Application PlatformsAI Productivity ToolsAI Development Frameworks
Visit Website
icon of ZCF

ZCF

Zero-Config Code Flow for Claude code & Codex, enabling seamless integration and configuration for AI development.

Open SourceClaude