Tag
Explore by tags

llmdocparser
A package for parsing PDFs and analyzing their content using LLMs.

whisper
Robust Speech Recognition via Large-Scale Weak Supervision

Mobile-Agent
Mobile-Agent is a powerful mobile device operation assistant family designed for complex task automation.

DeepChat
DeepChat is a feature-rich open-source AI chat platform supporting multiple cloud and local large language models.

Witsy
Witsy is a desktop AI assistant designed to enhance productivity across various applications.

Make-An-Audio
PyTorch implementation of a generative model for high-fidelity audio generation from text prompts.

auto-video-generateor
自动视频生成器,给定主题,自动生成解说视频。

MarkPDFdown
A high-quality PDF to Markdown tool based on large language model visual recognition.

EuroBERT
EuroBERT is a multilingual encoder model designed for European languages, trained using the Optimus training library.

ChatTTS-OpenVoice
Fuse ChatTTS with OpenVoice to clone your personalized voice from a 10-second audio clip upload.

Seamless Communication
Foundational Models for State-of-the-Art Speech and Text Translation.