164 7 months ago

tools
ollama run robbiemu/qwen3-coder:30b-a3b-i-q4_K_XL

Applications

Claude Code
Claude Code ollama launch claude --model robbiemu/qwen3-coder:30b-a3b-i-q4_K_XL
Codex
Codex ollama launch codex --model robbiemu/qwen3-coder:30b-a3b-i-q4_K_XL
OpenCode
OpenCode ollama launch opencode --model robbiemu/qwen3-coder:30b-a3b-i-q4_K_XL
OpenClaw
OpenClaw ollama launch openclaw --model robbiemu/qwen3-coder:30b-a3b-i-q4_K_XL

Models

View all →

Readme

This is unsloth’s popular quantization (https://huggingface.co/unsloth/Qwen3-Coder-30B-A3B-Instruct-GGUF/blob/main/Qwen3-Coder-30B-A3B-Instruct-UD-Q4_K_XL.gguf) with tool calling based on the discussion: https://github.com/ggml-org/llama.cpp/issues/15012

Apologies about the num_ctx being lower than the model’s actual supported maximum.