
Animation testing based on Bert-VITS2 for generating facial expressions and body animations from audio input.

A tool to convert videos and audios into various document styles like notes and mind maps.

An AI-powered interactive avatar engine using Live2D, LLM, ASR, TTS, and RVC for VTubing and virtual assistant applications.

OpenUtau is a free and open-source singing synthesis platform, serving as a successor to UTAU.

A TTS and STS library built on Apple's MLX framework for efficient speech synthesis on Apple Silicon.

Targeted Adversarial Examples on Speech-to-Text systems.