https://github.com/oamazonasgabriel/
-
qwen3.6-35b-a3b
A lightweight, variant of Qwen3.6-35B-A3B using Q4_K_M quantization. Modelfile Designed to fit within 24 GB total VRAM with a 16K context window.
tools thinking226 Pulls 1 Tag Updated 5 days ago
-
qwen3.5-9b
A coding-optimized configuration of Qwen3.5-9B designed for 16 GB single-GPU hardware. The model uses the official Q4_K_M quantization (~6.6 GB weights), leaving ~9 GB headroom for KV cache — enabling 32K+ context windows comfortably.
vision tools thinking4 Pulls 1 Tag Updated 4 hours ago