248 1 year ago

IQ4_XS quant of Qwen/QwQ-32B

tools
ollama run vanilj/qwq-32b-iq4_xs

Applications

Claude Code
Claude Code ollama launch claude --model vanilj/qwq-32b-iq4_xs
Codex
Codex ollama launch codex --model vanilj/qwq-32b-iq4_xs
OpenCode
OpenCode ollama launch opencode --model vanilj/qwq-32b-iq4_xs
OpenClaw
OpenClaw ollama launch openclaw --model vanilj/qwq-32b-iq4_xs

Models

View all →

Readme

I added the :short, medium, and long tags, each one have a system prompt designed to keep the thinking short, medium or long. It also sets the default context length to 16k.

GGUF from bartowski/QwQ-32B-GGUF.

Original model from Qwen/QwQ-32B.

The parameters are from the recommended Usage Guidelines.