VibeVoice is a community-maintained fork for expressive, longform conversational speech synthesis.
A comprehensive guide for fine-tuning and deploying open-source LLMs in Linux environments, tailored for beginners in China.
A large language model focused on social etiquette, covering prompt engineering, RAG, agent applications, and LLM fine-tuning tutorials.
A guide to DIY an end-to-end AI coding assistant similar to GitHub Copilot.
A curated collection of open-source Chinese large language models, focusing on smaller, privatizable, and cost-effective models.
Open-source Chinese LLaMA and Alpaca models for local CPU/GPU training and deployment.
DistillFlow is an open-source toolkit for distilling large language models into smaller, efficient models.
Streamline the fine-tuning process for multimodal models like PaliGemma 2, Florence-2, and Qwen2.5-VL.
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs with zero-code CLI and Web UI.
CogView4 is a text-to-image generation model from THUDM, along with its variants, focusing on improving image generation quality.
生成模型 tokenizer训练,模型初始化,模型预训练,指令微调。llama,creek
MAP-NEO is a fully open-sourced Large Language Model with state-of-the-art performance for diverse research applications.