633 Pulls 5 Tags Updated 4 months ago
547 Pulls 1 Tag Updated 5 months ago
508 Pulls 1 Tag Updated 6 months ago
3 Pulls 1 Tag Updated 1 month ago
GLM 4.6V Flash 9B model with vision, tools, and hybrid thinking enabled. using custom template to align it to ollama and the recomended sampling settigns by default. using unsloth quants at q4K_M
3,798 Pulls 1 Tag Updated 5 months ago
A model based on the GLM-4.6v-flash:9b q5_k_m, and uncensored. For local use I recommend editing the model context in modelfile as it is set to 128k. #EDIT: New local optimised model same with context 4096 https://ollama.com/ShreyanGondaliya/s5-reduced
415 Pulls 1 Tag Updated 4 months ago
GLM-4.6V-Flash (9B) is a lightweight model optimized for local deployment and low-latency applications. It scales its context window to 128k tokens in training and achieves SoTA performance in visual understanding among models of similar parameter scales.
366 Pulls 3 Tags Updated 4 months ago
unsloth/GLM-4.6V 106b
151 Pulls 1 Tag Updated 5 months ago
Abliterated (Uncensored) GLM4.6 Flash
7,155 Pulls 11 Tags Updated 6 months ago
GLM-4.6 is a hybrid reasoning model that provides two modes: a thinking mode for complex reasoning and tool use, and a non-thinking mode for immediate responses.
5,701 Pulls 9 Tags Updated 6 months ago
1,114 Pulls 1 Tag Updated 6 months ago
New version GLM-4.6
430 Pulls 1 Tag Updated 8 months ago
model imported from hf
239 Pulls 1 Tag Updated 6 months ago
This model is a mixed gguf q2ks format of Cerebras' GLM-4.6-REAP-218B-A32B-FP8 generated using Intel's AutoRound algorithm.
159 Pulls 1 Tag Updated 7 months ago
GLM-4.6-REAP-268B-A32B (by Cerebras), a memory-efficient compressed variant of GLM-4.6 that maintains near-identical performance while being 25% lighter.
131 Pulls 9 Tags Updated 6 months ago
This model requires Ollama v0.6.6 or later
5,816 Pulls 1 Tag Updated 1 year ago
GLM-4-0414 32B with 128k context (YaRN RoPE scaling). Needs ollama 0.6.6
1,077 Pulls 1 Tag Updated 1 year ago
gpt-oss_claude-sonnet4.6 is the GPT-OSS model running on the Claude Sonnet 4.6 system prompt, combining GPT-OSS's open-source foundation with Claude Sonnet 4.6's advanced instructions and behavior.
7,576 Pulls 1 Tag Updated 3 months ago
gemma3_claude-sonnet4.6 is the Gemma 3 model running on the Claude Sonnet 4.6 system prompt, combining Gemma 3's open-source foundation with Claude Sonnet 4.6's instructions and behavior.
915 Pulls 1 Tag Updated 3 months ago
Gemma 4 distilled from claude opus 4.6 thinking. Has only a 5% gap with claude opus 4.6 thinking while being over 40x smaller. Designed for server inference. Designed for local inference
560 Pulls 1 Tag Updated 2 months ago