glm-4.5 · Ollama

MichelRosselli/GLM-4.5-Air

GLM-4.5-Air is a hybrid reasoning model that provides two modes: a thinking mode for complex reasoning and tool use, and a non-thinking mode for immediate responses.

tools thinking

41.2K Pulls 9 Tags Updated 9 months ago

gurubot/GLM-4.5-Air-Derestricted

uncensored GLM Air

tools thinking

1,518 Pulls 1 Tag Updated 6 months ago

aiasistentworld/GLM-4.5-Air-LLM

The Conscious Partner Your Dynamic AI Collaborator

341 Pulls 1 Tag Updated 9 months ago

dset/GLM-4.5

tools thinking

168 Pulls 1 Tag Updated 9 months ago

SimonPu/GLM-4.7-Flash

This model was base on unsloth/GLM-4.7-Flash and trained on a small reasoning dataset of Claude Opus 4.5, with reasoning effort set to High.

tools thinking

1,066 Pulls 4 Tags Updated 4 months ago

ShreyanGondaliya/s5

A model based on the GLM-4.6v-flash:9b q5_k_m, and uncensored. For local use I recommend editing the model context in modelfile as it is set to 128k. #EDIT: New local optimised model same with context 4096 https://ollama.com/ShreyanGondaliya/s5-reduced

414 Pulls 1 Tag Updated 4 months ago

bjoernb/gemma4-e4b-fast

Gemma 4 E4B (Google DeepMind) with thinking mode disabled. Compact multimodal model — 4.5B effective / 8B total parameters. Supports text, image and audio input. Designed for edge devices and local deployment. Knowledge cutoff: January 2025.

vision tools thinking audio

1,352 Pulls 1 Tag Updated 2 months ago

bjoernb/gemma4-e4b-think

Gemma 4 E4B (Google DeepMind) with thinking mode enabled. Compact multimodal model — 4.5B effective / 8B total parameters. Supports text, image and audio input. Designed for edge devices and local deployment. Knowledge cutoff: January 2025.

vision tools thinking audio

737 Pulls 1 Tag Updated 2 months ago

ShreyanGondaliya/gemma-4-claude-opus-4.6-thinking-s7-multimodal

Gemma 4 distilled from claude opus 4.6 thinking. Has only a 5% gap with claude opus 4.6 thinking while being over 40x smaller. Designed for server inference. Designed for local inference

tools thinking

558 Pulls 1 Tag Updated 2 months ago

jewelzufo/unsloth_granite-4.0-h-350m-GGUF

Granite-4.0-H-350M is a lightweight instruct model finetuned from Granite-4.0-H-350M-Base using a combination of open-source instruction datasets with permissive license and internally collected synthetic datasets.

tools

426 Pulls 16 Tags Updated 6 months ago

mirage335/gpt-oss-20b-virtuoso

Generic all purpose model. Occasionally may have notable logic, usually Llama-3_3-Nemotron-Super-49B-v1_5 is preferred.

tools thinking

100 Pulls 1 Tag Updated 6 months ago

mirage335/gpt-oss-120b-virtuoso

Generic all purpose model. Occasionally may have notable logic, usually Llama-3_3-Nemotron-Super-49B-v1_5 is preferred.

tools thinking

85 Pulls 1 Tag Updated 6 months ago

type32/lemonade-rp

A PORT TO OLLAMA FROM THE ORIGINAL: https://huggingface.co/KatyTheCutie/LemonadeRP-4.5.3

281 Pulls 2 Tags Updated 1 year ago