a reasoning model finetuned from Qwen2.5-32B-Instruct on just 1,000 examples. It matches o1-preview & exhibits test-time scaling via budget forcing. available in [F16, q8_0, q6_K, q4_K_S]

tools

71 8 weeks ago

4 Tags
2570713ff25f • 66GB • 8 weeks ago
beec56789fb9 • 19GB • 8 weeks ago
0e29f83b6eda • 27GB • 8 weeks ago
79f0f1270a46 • 35GB • 8 weeks ago