Category
Explore by categories

AI Application PlatformsAI Video ToolsAI Audio Tools
RecordVideoWu
Details
A demo for recording audio and video streams with simultaneous speech and face recognition.

AI ModelsAI Application PlatformsAI Audio Tools
Faster Whisper
Details
Faster Whisper transcription with CTranslate2.

AI ModelsAI Application PlatformsAI Audio Tools
CSM
Details
A Conversational Speech Generation Model that generates audio codes from text and audio inputs.

AI Productivity ToolsAI Audio Tools
WhisperChain
Details
Speech to Text but with all the bells and whistles and most importantly AI!

AI ModelsAI Application PlatformsAI Audio Tools
Spark-TTS
Details
Spark-TTS is an advanced text-to-speech system using large language models for natural-sounding voice synthesis.

AI Application PlatformsAI Video ToolsAI Audio Tools
FastRTC
Details
The python library for real-time communication.

AI Application PlatformsAI Productivity ToolsAI Audio Tools
Local-NotebookLM
Details
A local AI-powered tool that converts PDF documents into engaging audio using local LLMs and TTS models.

AI Application PlatformsAI Productivity ToolsAI Audio Tools
clone-voice
Details
A sound cloning tool with a web interface, using your voice or any sound to record audio.

