17 1 week ago

A GPT-style language model built entirely from scratch (no HuggingFace/nanoGPT for the model itself) and trained on 50 children's moral stories. It's an educational reproduction of the full LLM lifecycle at miniature scale.

9066c9295822 · 101B
{
"num_ctx": 128,
"stop": [
"<|endoftext|>",
"<|user|>"
],
"temperature": 0.7,
"top_k": 40
}