Models
GitHub
Discord
Docs
Cloud
Sign in
Download
Models
Download
GitHub
Discord
Docs
Cloud
Sign in
richardyoung
/
qwen3-4b-reasoning
2
Downloads
Updated
5 days ago
qwen3-4b-reasoning is a 4B-parameter Qwen3-based reasoning “backfill” fine-tune (joeyzero/Qwen3-4B-Reasoning-Backfill-v0.1) converted to GGUF for llama.cpp/Ollama, with ~40K context and published as Q4_K_M (recommended) and iq4_xs (smaller).
qwen3-4b-reasoning is a 4B-parameter Qwen3-based reasoning “backfill” fine-tune (joeyzero/Qwen3-4B-Reasoning-Backfill-v0.1) converted to GGUF for llama.cpp/Ollama, with ~40K context and published as Q4_K_M (recommended) and iq4_xs (smaller).
Cancel
Name
2 models
Size
Context
Input
qwen3-4b-reasoning:Q4_K_M
cf2fe7713822
• 2.5GB • 40K context window •
Text input • 5 days ago
Text input • 5 days ago
qwen3-4b-reasoning:Q4_K_M
2.5GB
40K
Text
cf2fe7713822
· 5 days ago
qwen3-4b-reasoning:iq4_xs
c22e68adeef9
• 2.3GB • 40K context window •
Text input • 5 days ago
Text input • 5 days ago
qwen3-4b-reasoning:iq4_xs
2.3GB
40K
Text
c22e68adeef9
· 5 days ago