-
qwen3.5-opus-4.7-distilled
183 Pulls 1 Tag Updated 3 weeks ago
-
tiny-stories-50m-release
I trained a 51M parameter GPT-style model on an RTX 3080 10GB using TinyStories. It can generate coherent children’s-story style text. Includes HF Transformers weights and GGUF/Ollama export. Followed https://arxiv.org/abs/2305.07759
38 Pulls 1 Tag Updated 3 weeks ago