s1 is a reasoning model finetuned from Qwen2.5-32B-Instruct on just 1,000 examples. It matches o1-preview & exhibits test-time scaling via budget forcing.
tools
525 Pulls Updated 2 months ago
Updated 2 months ago
2 months ago
f03012c03b00 · 10GB
model
archqwen2
·
parameters32.8B
·
quantizationIQ4_XS
10GB
template
{{- if .Suffix }}<|fim_prefix|>{{ .Prompt }}<|fim_suffix|>{{ .Suffix }}<|fim_middle|>
{{- else if .M
1.6kB