Newsletter
Join the Community
Subscribe to our newsletter for the latest news and updates
Step-Audio is an open-source framework for intelligent speech interaction, supporting multilingual and emotional speech synthesis.

Nano Bananary is an AI batch image and video generator with 142 effects.

AI Podcast Generator for bilingual episodes, supporting multiple languages and alternative to NotebookLLM.

Zero-Config Code Flow for Claude code & Codex, enabling seamless integration and configuration for AI development.
Step-Audio is the first production-ready open-source framework for intelligent speech interaction that harmonizes comprehension and generation. It supports multilingual conversations (e.g., Chinese, English, Japanese), emotional tones (e.g., joy/sadness), regional dialects (e.g., Cantonese/Sichuanese), adjustable speech rates, and prosodic styles (e.g., rap).