A package for parsing PDFs and analyzing their content using LLMs.
Robust Speech Recognition via Large-Scale Weak Supervision
Mobile-Agent is a powerful mobile device operation assistant family designed for complex task automation.
DeepChat is a feature-rich open-source AI chat platform supporting multiple cloud and local large language models.
Witsy is a desktop AI assistant designed to enhance productivity across various applications.
PyTorch implementation of a generative model for high-fidelity audio generation from text prompts.
自动视频生成器,给定主题,自动生成解说视频。
A high-quality PDF to Markdown tool based on large language model visual recognition.
EuroBERT is a multilingual encoder model designed for European languages, trained using the Optimus training library.
Fuse ChatTTS with OpenVoice to clone your personalized voice from a 10-second audio clip upload.
Foundational Models for State-of-the-Art Speech and Text Translation.