35 3 weeks ago

Jan-v3 is a compact 4B-parameter model that leverages distillation from a larger teacher to maintain strong general performance and broad applicability while avoiding typical capacity limitations.

tools