26 2 weeks ago

Ring-flash-2.0 has a total of 100B parameters, with only 6.1B activated per inference.

100b