101 Downloads Updated 3 weeks ago
This is unsloth’s popular quantization (https://huggingface.co/unsloth/Qwen3-Coder-30B-A3B-Instruct-GGUF/blob/main/Qwen3-Coder-30B-A3B-Instruct-UD-Q4_K_XL.gguf) with tool calling based on the discussion: https://github.com/ggml-org/llama.cpp/issues/15012
Apologies about the num_ctx being lower than the model’s actual supported maximum.