AI Podcast Generator for bilingual episodes, supporting multiple languages and alternative to NotebookLLM.
VibeVoice is a community-maintained fork for expressive, longform conversational speech synthesis.
Official ElevenLabs MCP server for Text to Speech and audio processing APIs.
A user-friendly audio toolkit for voice recognition, voice transcription, voice conversion, etc.
AigcPanel is an easy-to-use one-stop AI digital human system supporting video composition and voice synthesis.
智能视频处理系统,提供音频处理、字幕生成、翻译功能等多项服务。
A generative speech model for daily dialogue.
A GitHub repository for ComfyUI's ChatTTS, enabling voice synthesis and multi-person dialogue generation.
快速提取音视频内容,整理成一份结构化的markdown笔记
LiberSonora is an AI-powered open-source audiobook toolkit with features like smart subtitle extraction and multilingual translation.
A free and open source Ai-based meeting note taker that runs locally on your device, supporting Mac and Windows.
FoloUp is an AI-powered voice interviewer designed to streamline the hiring process.