17 1 week ago

A GPT-style language model built entirely from scratch (no HuggingFace/nanoGPT for the model itself) and trained on 50 children's moral stories. It's an educational reproduction of the full LLM lifecycle at miniature scale.

ollama run sppandita85/story-llm

Details

1 week ago

c7adf62d748c · 2.0MB ·

gpt2
·
941K
·
F16
story-llm: a 0.94M-parameter GPT built from scratch and trained on 50 children's moral stories. An e
{ "num_ctx": 128, "stop": [ "<|endoftext|>", "<|user|>" ], "temperat
<|user|> {{ .Prompt }} <|assistant|>

Readme

No readme