A powerful framework for building realtime voice AI agents.
Oliva is a multi-agent assistant designed for various tasks like semantic search and text generation.
Fuse ChatTTS with OpenVoice to clone your personalized voice from a 10-second audio clip upload.
Real-time voice interactive digital human supporting customizable appearance and voice with low latency.
Orpheus TTS is an open-source system for human-sounding speech synthesis using Llama-3b backbone.
TTSFM is a reverse-engineered API server mirroring OpenAI's TTS service for text-to-speech conversion.
Demo app for Groq plugins in LiveKit Agents.
A simple voice generation tool that converts text to natural speech using the CosyVoice2 model.
Transforms research papers into engaging three-person podcast discussions for a fresh listening experience.
An AI chatbot integrating Dify and Coze platforms for WeChat with UI configuration and memory capabilities.
Dolphin is a multilingual, multitask ASR model jointly trained by DataoceanAI and Tsinghua University.
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.