A user-friendly audio toolkit for voice recognition, voice transcription, voice conversion, etc.
AigcPanel is an easy-to-use one-stop AI digital human system supporting video composition and voice synthesis.
智能视频处理系统,提供音频处理、字幕生成、翻译功能等多项服务。
A generative speech model for daily dialogue.
A GitHub repository for ComfyUI's ChatTTS, enabling voice synthesis and multi-person dialogue generation.
快速提取音视频内容,整理成一份结构化的markdown笔记
LiberSonora is an AI-powered open-source audiobook toolkit with features like smart subtitle extraction and multilingual translation.
A free and open source Ai-based meeting note taker that runs locally on your device, supporting Mac and Windows.
FoloUp is an AI-powered voice interviewer designed to streamline the hiring process.
Transform PDFs into AI podcasts for engaging on-the-go audio content.
一个基于 AI 的 Hacker News 中文播客项目,每天自动抓取 Hacker News 热门文章,通过 AI 生成中文总结并转换为播客内容。
Step-Audio is an open-source framework for intelligent speech interaction, supporting multilingual and emotional speech synthesis.