🪐 A family of small models with 135M, 360M, and 1.7B parameters, trained on a new high-quality dataset.

135m 360m 1.7b

82.6K 2 months ago

ca7a9654b546 · 89B
{
"stop": [
"<|im_start|>",
"<|im_end|>"
],
"temperature": 0.2,
"top_p": 0.9
}