35 3 weeks ago

Jan-v3 is a compact 4B-parameter model that leverages distillation from a larger teacher to maintain strong general performance and broad applicability while avoiding typical capacity limitations.

tools
25cf04b04759 · 100B
{
"stop": [
"<|im_start|>",
"<|im_end|>"
],
"temperature": 0.7,
"top_k": 20,
"top_p": 0.8
}