547 Downloads Updated 5 months ago
ollama run ucx0204/glm-4.6V-Flash-Q8
Updated 5 months ago
5 months ago
1e43920e24a6 · 12GB ·
This is a GGUF version of the GLM-4.6V-Flash model, quantized to Q8_0 (8-bit) for high-quality inference. It originates from Zhipu AI and was converted/quantized by Unsloth.