247 7 months ago

IQ4_XS quant of Qwen/QwQ-32B

tools

7 months ago

17604fc0bd6b · 18GB ·

qwen2
·
32.8B
·
IQ1_M
{{- if .Suffix }}<|fim_prefix|>{{ .Prompt }}<|fim_suffix|>{{ .Suffix }}<|fim_middle|> {{- else if .M
Apache License Version 2.0, January 2004 http://www.apache.org/licenses/ TERMS AND CONDITIONS FOR US
{ "stop": [ "<|im_start|>", "<|im_end|>" ], "temperature": 0.6, "top

Readme

I added the :short, medium, and long tags, each one have a system prompt designed to keep the thinking short, medium or long. It also sets the default context length to 16k.

GGUF from bartowski/QwQ-32B-GGUF.

Original model from Qwen/QwQ-32B.

The parameters are from the recommended Usage Guidelines.