Models
GitHub
Discord
Docs
Cloud
Sign in
Download
Models
Download
GitHub
Discord
Docs
Cloud
Sign in
ExpedientFalcon
/
Qwen3-4B-UD-Q5_K_XL
:latest
91
Downloads
Updated
5 months ago
Qwen3-4B Q5_K_XL Unsloth UD 2.0 adaptively quantized model, much better for coding than vanilla Q4_K_M quants without taking up the VWAM of an 8-bit Q8_0 model. From https://huggingface.co/unsloth/Qwen3-4B-GGUF/tree/main
Qwen3-4B Q5_K_XL Unsloth UD 2.0 adaptively quantized model, much better for coding than vanilla Q4_K_M quants without taking up the VWAM of an 8-bit Q8_0 model. From https://huggingface.co/unsloth/Qwen3-4B-GGUF/tree/main
Cancel
tools
Updated 5 months ago
5 months ago
f5bc81c7b7e3 · 2.9GB ·
model
arch
qwen3
·
parameters
4.02B
·
quantization
Q5_K_M
2.9GB
params
{ "min_p": 0, "num_ctx": 40960, "num_predict": 16384, "repeat_penalty": 1, "stop
166B
template
{{- if .Messages }} {{- if or .System .Tools }}<|im_start|>system {{- if .System }} {{ .System }} {{
1.5kB
Readme
No readme
Write
Preview
Paste, drop or click to upload images (.png, .jpeg, .jpg, .svg, .gif)