A curated collection of open-source Chinese large language models, focusing on smaller, privatizable, and cost-effective models.
Open-source Chinese LLaMA and Alpaca models for local CPU/GPU training and deployment.
DeepSeek-V3 is an advanced Mixture-of-Experts language model with innovative inference capabilities and efficient training methods.
DeepSeek-VL2 is a series of advanced Mixture-of-Experts Vision-Language Models for multimodal understanding.
A GitHub repository exploring the intersection of UAVs and Large Language Models.
HeadInfer is a memory-efficient inference framework for large language models that reduces GPU memory consumption.
A knowledge-sharing platform about large language models for job interviews and general understanding.
Wan2.1 is an open suite of advanced video generative models, enabling innovative video creation and editing.
A Unified Tokenizer for Visual Generation and Understanding.
MAP-NEO is a fully open-sourced Large Language Model with state-of-the-art performance for diverse research applications.
Notes for software engineers getting up to speed on new AI developments, including resources and frameworks.
A comprehensive survey on benchmarks for Multimodal Large Language Models (MLLMs).