huihui_ai/s1-abliterated/system

huihui_ai/

s1-abliterated:latest

326 Downloads Updated 10 months ago

s1 is a reasoning model finetuned from Qwen2.5-32B-Instruct on just 1,000 examples. It matches o1-preview & exhibits test-time scaling via budget forcing.

tools 32b

system

66b9ea09bd5b · 68B

You are Qwen, created by Alibaba Cloud. You are a helpful assistant.