Ollama
Models Docs Pricing
Sign in Download
Models Download Docs Pricing Sign in
⇅
glm-4.5 · Ollama
Search for models on Ollama.
  • MichelRosselli/GLM-4.5-Air

    GLM-4.5-Air is a hybrid reasoning model that provides two modes: a thinking mode for complex reasoning and tool use, and a non-thinking mode for immediate responses.

    tools thinking

    41.2K  Pulls 9  Tags Updated  9 months ago

  • gurubot/GLM-4.5-Air-Derestricted

    uncensored GLM Air

    tools thinking

    1,518  Pulls 1  Tag Updated  6 months ago

  • aiasistentworld/GLM-4.5-Air-LLM

    The Conscious Partner Your Dynamic AI Collaborator

    341  Pulls 1  Tag Updated  9 months ago

  • dset/GLM-4.5

    tools thinking

    168  Pulls 1  Tag Updated  9 months ago

  • SimonPu/GLM-4.7-Flash

    This model was base on unsloth/GLM-4.7-Flash and trained on a small reasoning dataset of Claude Opus 4.5, with reasoning effort set to High.

    tools thinking

    1,066  Pulls 4  Tags Updated  4 months ago

  • ShreyanGondaliya/s5

    A model based on the GLM-4.6v-flash:9b q5_k_m, and uncensored. For local use I recommend editing the model context in modelfile as it is set to 128k. #EDIT: New local optimised model same with context 4096 https://ollama.com/ShreyanGondaliya/s5-reduced

    414  Pulls 1  Tag Updated  4 months ago

  • bjoernb/gemma4-e4b-fast

    Gemma 4 E4B (Google DeepMind) with thinking mode disabled. Compact multimodal model — 4.5B effective / 8B total parameters. Supports text, image and audio input. Designed for edge devices and local deployment. Knowledge cutoff: January 2025.

    vision tools thinking audio

    1,352  Pulls 1  Tag Updated  2 months ago

  • bjoernb/gemma4-e4b-think

    Gemma 4 E4B (Google DeepMind) with thinking mode enabled. Compact multimodal model — 4.5B effective / 8B total parameters. Supports text, image and audio input. Designed for edge devices and local deployment. Knowledge cutoff: January 2025.

    vision tools thinking audio

    737  Pulls 1  Tag Updated  2 months ago

  • ShreyanGondaliya/gemma-4-claude-opus-4.6-thinking-s7-multimodal

    Gemma 4 distilled from claude opus 4.6 thinking. Has only a 5% gap with claude opus 4.6 thinking while being over 40x smaller. Designed for server inference. Designed for local inference

    tools thinking

    558  Pulls 1  Tag Updated  2 months ago

  • jewelzufo/unsloth_granite-4.0-h-350m-GGUF

    Granite-4.0-H-350M is a lightweight instruct model finetuned from Granite-4.0-H-350M-Base using a combination of open-source instruction datasets with permissive license and internally collected synthetic datasets.

    tools

    426  Pulls 16  Tags Updated  6 months ago

  • mirage335/gpt-oss-20b-virtuoso

    Generic all purpose model. Occasionally may have notable logic, usually Llama-3_3-Nemotron-Super-49B-v1_5 is preferred.

    tools thinking

    100  Pulls 1  Tag Updated  6 months ago

  • mirage335/gpt-oss-120b-virtuoso

    Generic all purpose model. Occasionally may have notable logic, usually Llama-3_3-Nemotron-Super-49B-v1_5 is preferred.

    tools thinking

    85  Pulls 1  Tag Updated  6 months ago

  • type32/lemonade-rp

    A PORT TO OLLAMA FROM THE ORIGINAL: https://huggingface.co/KatyTheCutie/LemonadeRP-4.5.3

    281  Pulls 2  Tags Updated  1 year ago

© 2026 Ollama
Blog Contact