25 2 weeks ago

Ring-flash-2.0 has a total of 100B parameters, with only 6.1B activated per inference.

100b
77c25c0df517 · 35B
{
"num_ctx": 4096,
"temperature": 0.7
}