164 Downloads Updated 7 months ago
ollama run robbiemu/qwen3-coder:30b-a3b-i-q4_K_XL
Updated 7 months ago
7 months ago
b37ca1b42110 · 18GB ·
This is unsloth’s popular quantization (https://huggingface.co/unsloth/Qwen3-Coder-30B-A3B-Instruct-GGUF/blob/main/Qwen3-Coder-30B-A3B-Instruct-UD-Q4_K_XL.gguf) with tool calling based on the discussion: https://github.com/ggml-org/llama.cpp/issues/15012
Apologies about the num_ctx being lower than the model’s actual supported maximum.