38 3 weeks ago

I trained a 51M parameter GPT-style model on an RTX 3080 10GB using TinyStories. It can generate coherent children’s-story style text. Includes HF Transformers weights and GGUF/Ollama export. Followed https://arxiv.org/abs/2305.07759

ollama run evanollama/tiny-stories-50m-release

Details

3 weeks ago

ad76cb1bccbe · 105MB ·

gpt2
·
51.2M
·
F16

Readme

No readme