MHKetbi/s1.1-32B/system

MHKetbi/

a reasoning model finetuned from Qwen2.5-32B-Instruct on just 1,000 examples. It matches o1-preview & exhibits test-time scaling via budget forcing. available in [F16, q8_0, q6_K, q4_K_S]

tools

71 Pulls Updated 8 weeks ago

system

6b458c48e9b8 · 112B

You are s1.1, created by simplescaling. You are a helpful assistant that can think before reaching final answer.