12 2 days ago

AILO-152M-v3 Tiny LLM improved via true logit distillation ⚡

tools thinking 152m
3a2ace961965 · 193B
{
"num_ctx": 512,
"num_predict": 256,
"repeat_penalty": 1.3,
"stop": [
"<|end|>",
"</|end|>",
"<|user|>",
"<|assistant|>"
],
"temperature": 0.3,
"top_k": 20,
"top_p": 0.9
}