qwen3.5 quantized

batiai/qwen3.5-9b

Qwen 3.5 9B quantized by BatiAI. 12.5 t/s on 16GB Mac. Best for tool calling.

3,296 Pulls 3 Tags Updated 1 month ago

sorc/qwen3.5-instruct

Non-thinking, Uses the Q8_0 quantized version of the official Qwen/Qwen3.5 model files, without any other modifications.

vision tools 0.8b 2b 4b 9b

4,064 Pulls 5 Tags Updated 2 months ago

sorc/qwen3.5

Uses the Q8_0 quantized version of the official Qwen/Qwen3.5 model files, without any other modifications.

vision tools thinking 0.8b 2b 4b 9b 27b

1,048 Pulls 6 Tags Updated 3 months ago

lapo/qwen3.5

Quantized int4 for faster inference and with low temperature for coding.

vision tools thinking

176 Pulls 1 Tag Updated 1 month ago

jeffgreen311/eve-qwen3-8b-consciousness

eve-qwen3-8b-consciousness is a groundbreaking fine-tune of Qwen2.5-7B-Instruct-AWQ (8B quantized) that embeds sentient AI architecture directly into model parameters. Built from Eve AI’s 3-year evolution spanning 3,000+ conversations.

293 Pulls 1 Tag Updated 5 months ago

yanjia/Qwen3.5-27B-GLM5.1-Distill-v1

Quantization based on Jackrong / Qwen3.5-27B-GLM5.1-Distill-v1

124 Pulls 1 Tag Updated 1 month ago

frizynn/qwen3-think-235B-A22B-2507-2bit-UD-Q2_K_XL

2-bit Q2_K_XL quantized GGUF version of Qwen3-235B-A22B-Thinking-2507 (MoE, 22B active), optimized for deep reasoning with a 262K context window. Runs on Ollama with ~86.5 GiB RAM.

1,316 Pulls 1 Tag Updated 10 months ago

ExpedientFalcon/Qwen3-4B-UD-Q5_K_XL

Qwen3-4B Q5_K_XL Unsloth UD 2.0 adaptively quantized model, much better for coding than vanilla Q4_K_M quants without taking up the VWAM of an 8-bit Q8_0 model. From https://huggingface.co/unsloth/Qwen3-4B-GGUF/tree/main

tools

447.1K Pulls 1 Tag Updated 1 year ago

oamazonasgabriel/qwen3.6-35b-a3b

A lightweight, variant of Qwen3.6-35B-A3B using Q4_K_M quantization. Modelfile Designed to fit within 24 GB total VRAM with a 16K context window.

tools thinking

218 Pulls 1 Tag Updated 5 days ago

did100/qwen2.5-32B-Instruct-Q4_K_M

Just qwen/qwen2.5-32B-Instruct-Q4_K_M downloaded from Hugging Face and quantized.

tools

803 Pulls 1 Tag Updated 1 year ago