25 3 months ago

The Larger Raven - Corvid intelligence principle. 3B parameters with primate-level cognition. Architecture over size, efficiency revolution. Feel the speed!

0bc632aa169a · 165B
{
"num_batch": 512,
"num_ctx": 8192,
"num_gpu": 99,
"repeat_penalty": 1.1,
"stop": [
"<|im_end|>",
"<|im_start|>"
],
"temperature": 0.7,
"top_k": 40,
"top_p": 0.9
}