Ollama model based on Unsloth's UD-Q2_K_XL quantization of Qwen3-235B-A22B-Instruct-2507.
246 Pulls 1 Tag Updated 5 months ago
2-bit Q2_K_XL quantized GGUF version of Qwen3-235B-A22B-Thinking-2507 (MoE, 22B active), optimized for deep reasoning with a 262K context window. Runs on Ollama with ~86.5 GiB RAM.
1,331 Pulls 1 Tag Updated 10 months ago
(c) https://huggingface.co/unsloth/Qwen3-235B-A22B-Instruct-2507-GGUF, non-thinking model
438 Pulls 1 Tag Updated 10 months ago
256 Pulls 1 Tag Updated 10 months ago
Qwen3-235B-A22B-Instruct-2507 nothink Q8
138 Pulls 1 Tag Updated 10 months ago
556 Pulls 1 Tag Updated 1 year ago
dynamic quants 2.0 from unsloth, merged
125 Pulls 1 Tag Updated 1 year ago