247 7 months ago

IQ4_XS quant of Qwen/QwQ-32B

tools

7 months ago

929552370002 · 18GB ·

qwen2
·
32.8B
·
IQ1_M
You have unlimited time to think and respond to the user’s question. There is no need to worry abo
Apache License Version 2.0, January 2004 http://www.apache.org/licenses/ TERMS AND CONDITIONS FOR US
{ "min_p": 0.1, "num_ctx": 16384, "repeat_penalty": 1, "stop": [ "<|im_start
{{- if .Suffix }}<|fim_prefix|>{{ .Prompt }}<|fim_suffix|>{{ .Suffix }}<|fim_middle|> {{- else if .M

Readme

I added the :short, medium, and long tags, each one have a system prompt designed to keep the thinking short, medium or long. It also sets the default context length to 16k.

GGUF from bartowski/QwQ-32B-GGUF.

Original model from Qwen/QwQ-32B.

The parameters are from the recommended Usage Guidelines.