Ollama
Models Docs Pricing
Sign in Download
Models Download Docs Pricing Sign in
⇅
glm · Ollama
Search for models on Ollama.
  • glm-4.7-flash

    As the strongest model in the 30B class, GLM-4.7-Flash offers a new option for lightweight deployment that balances performance and efficiency.

    tools thinking

    1M  Pulls 4  Tags Updated  2 months ago

  • glm-5

    A strong reasoning and agentic model from Z.ai with 744B total parameters (40B active), built for complex systems engineering and long-horizon tasks.

    tools thinking cloud

    155.3K  Pulls 1  Tag Updated  1 month ago

  • glm-ocr

    GLM-OCR is a multimodal OCR model for complex document understanding, built on the GLM-V encoder–decoder architecture.

    vision tools

    177.9K  Pulls 3  Tags Updated  1 month ago

  • glm-4.7

    Advancing the Coding Capability

    tools thinking cloud

    81.8K  Pulls 1  Tag Updated  3 months ago

  • glm-4.6

    Advanced agentic, reasoning and coding capabilities.

    tools thinking cloud

    98.8K  Pulls 1  Tag Updated  5 months ago

  • glm4

    A strong multi-lingual general language model with competitive performance to Llama 3.

    9b

    878.1K  Pulls 32  Tags Updated  1 year ago

  • gemma4

    Gemma 4 models are designed to deliver frontier-level performance at each size. They are well-suited for reasoning, agentic workflows, coding, and multimodal understanding.

    vision tools audio cloud e2b e4b 26b 31b

    187.8K  Pulls 16  Tags Updated  yesterday

  • granite3.3

    IBM Granite 2B and 8B models are 128K context length language models that have been fine-tuned for improved reasoning and instruction-following capabilities.

    tools 2b 8b

    970.8K  Pulls 3  Tags Updated  11 months ago

  • gemma

    Gemma is a family of lightweight, state-of-the-art open models built by Google DeepMind. Updated to version 1.1

    2b 7b

    6.6M  Pulls 102  Tags Updated  1 year ago

  • granite3.1-moe

    The IBM Granite 1B and 3B models are long-context mixture of experts (MoE) Granite models from IBM designed for low latency usage.

    tools 1b 3b

    2.7M  Pulls 33  Tags Updated  1 year ago

  • granite3.2-vision

    A compact and efficient vision-language model, specifically designed for visual document understanding, enabling automated content extraction from tables, charts, infographics, plots, diagrams, and more.

    vision tools 2b

    860.2K  Pulls 5  Tags Updated  1 year ago

  • granite3.1-dense

    The IBM Granite 2B and 8B models are text-only dense LLMs trained on over 12 trillion tokens of data, demonstrated significant improvements over their predecessors in performance and speed in IBM’s initial testing.

    tools 2b 8b

    776.7K  Pulls 33  Tags Updated  1 year ago

  • granite3.2

    Granite-3.2 is a family of long-context AI models from IBM Granite fine-tuned for thinking capabilities.

    tools 2b 8b

    383.8K  Pulls 9  Tags Updated  1 year ago

  • goliath

    A language model created by combining two fine-tuned Llama 2 70B models into one.

    377.3K  Pulls 16  Tags Updated  2 years ago

  • qwen3

    Qwen3 is the latest generation of large language models in Qwen series, offering a comprehensive suite of dense and mixture-of-experts (MoE) models.

    tools thinking 0.6b 1.7b 4b 8b 14b 30b 32b 235b

    25.7M  Pulls 58  Tags Updated  5 months ago

  • granite4

    Granite 4 features improved instruction following (IF) and tool-calling capabilities, making them more effective in enterprise applications.

    tools 350m 1b 3b

    1M  Pulls 17  Tags Updated  5 months ago

  • glennjammin/log-doctor

    Model to analyze log files and help troubleshoot errors based on ministral-3:3b model

    tools

    32  Pulls 1  Tag Updated  1 month ago

  • gemma3

    The current, most capable model that runs on a single GPU.

    vision cloud 270m 1b 4b 12b 27b

    34.9M  Pulls 29  Tags Updated  3 months ago

  • gemma2

    Google Gemma 2 is a high-performing and efficient model available in three sizes: 2B, 9B, and 27B.

    2b 9b 27b

    19.7M  Pulls 94  Tags Updated  1 year ago

  • codegemma

    CodeGemma is a collection of powerful, lightweight models that can perform a variety of coding tasks like fill-in-the-middle code completion, code generation, natural language understanding, mathematical reasoning, and instruction following.

    2b 7b

    2.7M  Pulls 85  Tags Updated  1 year ago

© 2026 Ollama
Blog Contact