Project Icon

nanoGPT

Easily Train and Adapt Medium-Sized GPT Models with Accessible Code

Product DescriptionnanoGPT is a simple and fast repository for training and fine-tuning medium-sized GPT models. As a rewrite of minGPT, it emphasizes simple code for easy adaptation, allowing for both new model training and fine-tuning of pre-trained checkpoints. By leveraging popular frameworks such as PyTorch and Hugging Face Transformers, nanoGPT supports training on a range of hardware from advanced GPUs to basic computers, showcasing versatility in reproducing GPT-2 results with OpenWebText.
Project Details