A Conversational Speech Generation Model that generates audio codes from text and audio inputs.
Speech to Text but with all the bells and whistles and most importantly AI!
The python library for real-time communication.
A local AI-powered tool that converts PDF documents into engaging audio using local LLMs and TTS models.
A sound cloning tool with a web interface, using your voice or any sound to record audio.
Robust Speech Recognition via Large-Scale Weak Supervision
Build your own AI friend using ESP32 and various AI technologies.
A local alternative to Manus AI that operates autonomously without cloud dependency or high costs.
Coze-on-Wechat is a personal smart assistant for WeChat, integrating most functionalities of Coze Bot.
自动视频生成器,给定主题,自动生成解说视频。