547 Downloads Updated 5 months ago
ollama run ucx0204/glm-4.6V-Flash-Q8
ollama launch claude --model ucx0204/glm-4.6V-Flash-Q8
ollama launch codex-app --model ucx0204/glm-4.6V-Flash-Q8
ollama launch openclaw --model ucx0204/glm-4.6V-Flash-Q8
ollama launch hermes --model ucx0204/glm-4.6V-Flash-Q8
ollama launch codex --model ucx0204/glm-4.6V-Flash-Q8
ollama launch opencode --model ucx0204/glm-4.6V-Flash-Q8
This is a GGUF version of the GLM-4.6V-Flash model, quantized to Q8_0 (8-bit) for high-quality inference. It originates from Zhipu AI and was converted/quantized by Unsloth.