633 Pulls 5 Tags Updated 4 months ago
547 Pulls 1 Tag Updated 5 months ago
508 Pulls 1 Tag Updated 6 months ago
GLM 4.6V Flash 9B model with vision, tools, and hybrid thinking enabled. using custom template to align it to ollama and the recomended sampling settigns by default. using unsloth quants at q4K_M
3,798 Pulls 1 Tag Updated 5 months ago
A model based on the GLM-4.6v-flash:9b q5_k_m, and uncensored. For local use I recommend editing the model context in modelfile as it is set to 128k. #EDIT: New local optimised model same with context 4096 https://ollama.com/ShreyanGondaliya/s5-reduced
415 Pulls 1 Tag Updated 4 months ago
GLM-4.6V-Flash (9B) is a lightweight model optimized for local deployment and low-latency applications. It scales its context window to 128k tokens in training and achieves SoTA performance in visual understanding among models of similar parameter scales.
366 Pulls 3 Tags Updated 4 months ago
unsloth/GLM-4.6V 106b
151 Pulls 1 Tag Updated 5 months ago
Abliterated (Uncensored) GLM4.6 Flash
7,155 Pulls 11 Tags Updated 6 months ago
1,114 Pulls 1 Tag Updated 6 months ago
model imported from hf
239 Pulls 1 Tag Updated 6 months ago
New version GLM-4.6
430 Pulls 1 Tag Updated 8 months ago
GLM-4.6-REAP-268B-A32B (by Cerebras), a memory-efficient compressed variant of GLM-4.6 that maintains near-identical performance while being 25% lighter.
131 Pulls 9 Tags Updated 6 months ago
GLM-4.6 is a hybrid reasoning model that provides two modes: a thinking mode for complex reasoning and tool use, and a non-thinking mode for immediate responses.
5,701 Pulls 9 Tags Updated 6 months ago