Netflix-level subtitle cutting, translation, alignment, and dubbing - one-click fully automated AI video subtitle team.
A user-friendly audio toolkit for voice recognition, voice transcription, voice conversion, etc.
E2M converts various file types into Markdown, offering an easy installation and flexible, open-source solution.
一键将视频转换为优质小红书笔记,自动优化内容和配图
AigcPanel is an easy-to-use one-stop AI digital human system supporting video composition and voice synthesis.
A GitHub repository for ComfyUI's ChatTTS, enabling voice synthesis and multi-person dialogue generation.
快速提取音视频内容,整理成一份结构化的markdown笔记
A powered tool for easy and efficient video subtitling, supporting speech recognition, translation, and subtitle optimization.
LiberSonora is an AI-powered open-source audiobook toolkit with features like smart subtitle extraction and multilingual translation.
Step-Audio is an open-source framework for intelligent speech interaction, supporting multilingual and emotional speech synthesis.
A demo for recording audio and video streams with simultaneous speech and face recognition.
Faster Whisper transcription with CTranslate2.