Tiny-R1-32B-Preview, which outperforms the 70B model Deepseek-R1-Distill-Llama-70B and nearly matches the full R1 model in math.

tools 32b

864 4 weeks ago

6 Tags
dffc6569dea2 • 20GB • 4 weeks ago
dffc6569dea2 • 20GB • 4 weeks ago
dffc6569dea2 • 20GB • 4 weeks ago
4f1f113e2cd3 • 66GB • 4 weeks ago
dffc6569dea2 • 20GB • 4 weeks ago
3d2fadf6c7f6 • 35GB • 4 weeks ago