HeadInfer is a memory-efficient inference framework for large language models that reduces GPU memory consumption.
A knowledge-sharing platform about large language models for job interviews and general understanding.
Wan2.1 is an open suite of advanced video generative models, enabling innovative video creation and editing.
A Unified Tokenizer for Visual Generation and Understanding.
MAP-NEO is a fully open-sourced Large Language Model with state-of-the-art performance for diverse research applications.
Notes for software engineers getting up to speed on new AI developments, including resources and frameworks.
A comprehensive survey on benchmarks for Multimodal Large Language Models (MLLMs).