Qihoo 360's first-generation reasoning model, Tiny-R1-32B-Preview, which outperforms the 70B model Deepseek-R1-Distill-Llama-70B and nearly matches the full R1 model in math.

32b

42 5 weeks ago

1 Tag
92a69ba150e5 • 20GB • 5 weeks ago