
VibeVoice is a community-maintained fork for expressive, longform conversational speech synthesis.

PyTorch implementation of a generative model for high-fidelity audio generation from text prompts.

Orpheus TTS is an open-source system for human-sounding speech synthesis using Llama-3b backbone.

TTSFM is a reverse-engineered API server mirroring OpenAI's TTS service for text-to-speech conversion.