38 3 weeks ago

I trained a 51M parameter GPT-style model on an RTX 3080 10GB using TinyStories. It can generate coherent children’s-story style text. Includes HF Transformers weights and GGUF/Ollama export. Followed https://arxiv.org/abs/2305.07759

ollama run evanollama/tiny-stories-50m-release

Models

View all →

Readme

No readme