AnimeGamer is an infinite anime life simulation tool that predicts game states using multimodal models.
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
Official repository for generating 360-degree human head views from single portrait images using diffusion techniques.
SOTA Open Source TTS for high-quality text-to-speech synthesis with multilingual support.
A flexible framework for optimizing local deployments of large language models with cutting-edge inference techniques.
A one-stop solution for creating digital avatars from WeChat chat records using fine-tuned large language models.
Speech-AI-Forge is a project centered on TTS generation, offering an API Server and a Gradio-based WebUI.
Notes and exploration code for learning about AI/ML.
Multi-modal OCR pipeline optimized for ML training (text, figure, math, tables, diagrams).
Model Activity Visualiser for visualizing the internal workings of Large Language Models as they generate text.
Efficient full parameter tuning library for reinforcement learning applications in LLMs.
CodeScientist is an automated scientific discovery system for code-based experiments using LLMs.