LogoAISecKit
  • Search
  • Collection
  • Category
  • Tag
  • Blog
  • Pricing
  • Submit
LogoAISecKit

Newsletter

Join the Community

Subscribe to our newsletter for the latest news and updates

LogoAISecKit

Curated directory of 1700+ AI tools, models, frameworks, MCP servers, and cybersecurity resources

GitHub
Product
  • Search
  • Collection
  • Category
  • Tag
Resources
  • Blog
  • Pricing
  • Submit
Company
  • About Us
  • Privacy Policy
  • Terms of Service
  • Sitemap
Copyright © 2026 All Rights Reserved.
Sponsored Resources
  1. Home
  2. Category
  3. MiniMind

MiniMind

MiniMind is an open-source project to train a 26M-parameter GPT model from scratch in just 2 hours.

Visit Website
Visit Website

Introduction

MiniMind: Train a 26M-Parameter GPT from Scratch in Just 2 Hours!

MiniMind is an open-source project aimed at lowering the barrier to entry for training large language models (LLMs). With just a minimal cost of around 3 RMB and a training time of 2 hours on a single NVIDIA 3090 GPU, users can train a lightweight 26M-parameter GPT model from scratch.

Key Features:
  • Lightweight Model: The smallest version of MiniMind is only 25.8M parameters, making it accessible for personal GPUs.
  • Comprehensive Training Process: The project includes detailed code for pre-training, supervised fine-tuning (SFT), reinforcement learning from human feedback (RLHF), and model distillation.
  • Open Source: All core algorithms are implemented from scratch using PyTorch, without relying on third-party libraries.
  • Multi-Modal Capabilities: MiniMind has been extended to support visual multi-modal tasks with MiniMind-V.
  • User-Friendly: The project serves as a tutorial for beginners in LLMs, providing a hands-on experience in training and understanding the underlying mechanisms of large models.
Benefits:
  • Cost-Effective: Users can experience the entire process of building a language model for less than 3 RMB.
  • Educational Resource: Ideal for those looking to learn about LLMs and their training processes.
  • Community Driven: Encourages contributions and improvements from the community, fostering a collaborative environment for AI development.
Highlights:
  • Supports single and multi-GPU training.
  • Compatible with popular frameworks like transformers and trl.
  • Provides a simple API for integration with third-party applications.

Join the MiniMind community and start your journey in AI model training today!

Back

Information

  • Publisher
    AISecKit
  • Websitegithub.com
  • Published date2025/04/28

Categories

  • AI Models
  • AI Research Papers
  • AI Development Frameworks

Tags

  • Reinforcement Learning
  • Open Source
  • GPT Models

More Products

image of Nano Bananary
AI ModelsAI Application PlatformsAI Video Tools
Visit Website
icon of Nano Bananary

Nano Bananary

Nano Bananary is an AI batch image and video generator with 142 effects.

Text-to-VideoGenerative AI
image of ZCF
AI Application PlatformsAI Productivity ToolsAI Development Frameworks
Visit Website
icon of ZCF

ZCF

Zero-Config Code Flow for Claude code & Codex, enabling seamless integration and configuration for AI development.

Open SourceClaude
image of Awesome Public Datasets
AI ModelsAI Application PlatformsAI Productivity Tools
Visit Website
icon of Awesome Public Datasets

Awesome Public Datasets

A topic-centric list of HQ open datasets for various fields and applications.