Ollama model based on Unsloth's UD-Q2_K_XL quantization of Qwen3-235B-A22B-Instruct-2507.
236 Pulls 1 Tag Updated 5 months ago
2-bit Q2_K_XL quantized GGUF version of Qwen3-235B-A22B-Thinking-2507 (MoE, 22B active), optimized for deep reasoning with a 262K context window. Runs on Ollama with ~86.5 GiB RAM.
1,316 Pulls 1 Tag Updated 10 months ago
(c) https://huggingface.co/unsloth/Qwen3-235B-A22B-Instruct-2507-GGUF, non-thinking model
435 Pulls 1 Tag Updated 10 months ago
253 Pulls 1 Tag Updated 10 months ago
Qwen3-235B-A22B-Instruct-2507 nothink Q8
138 Pulls 1 Tag Updated 10 months ago
554 Pulls 1 Tag Updated 1 year ago