2 Downloads Updated 5 days ago
Qwen3-4B-Reasoning is a GGUF conversion of joeyzero/Qwen3-4B-Reasoning-Backfill-v0.1 for llama.cpp / Ollama.
Upstream: https://huggingface.co/joeyzero/Qwen3-4B-Reasoning-Backfill-v0.1
Alias matches existing local artifacts; adjust if needed.
ChatMLqwen34.0B40960apache-2.0top_k=20, top_p=0.949999988079071, temp=0.6000000238418579| Tag | GGUF | Size | RAM (est.) | Notes |
|---|---|---|---|---|
IQ4_XS |
Qwen3-4B-Reasoning-IQ4_XS.gguf |
2.13 GiB | 4 GiB | |
Q4_K_M |
Qwen3-4B-Reasoning-Q4_K_M.gguf |
2.33 GiB | 4 GiB | Recommended |
ollama run richardyoung/qwen3-4b-reasoning:q4_k_m "Hello!"
ollama run richardyoung/qwen3-4b-reasoning:iq4_xsollama run richardyoung/qwen3-4b-reasoning:q4_k_mSee the upstream repo for license/terms: https://huggingface.co/joeyzero/Qwen3-4B-Reasoning-Backfill-v0.1
llama-quantize).convert_hf_to_gguf.py).