Ollama
Models Docs Pricing
Sign in Download
Models Download Docs Pricing Sign in
⇅
gemma · Ollama
Search for models on Ollama.
  • gemma4

    Gemma 4 models are designed to deliver frontier-level performance at each size. They are well-suited for reasoning, agentic workflows, coding, and multimodal understanding.

    vision tools audio cloud e2b e4b 26b 31b

    281.7K  Pulls 16  Tags Updated  yesterday

  • gemma3n

    Gemma 3n models are designed for efficient execution on everyday devices such as laptops, tablets or phones.

    e2b e4b

    1.5M  Pulls 9  Tags Updated  9 months ago

  • gemma3

    The current, most capable model that runs on a single GPU.

    vision cloud 270m 1b 4b 12b 27b

    34.9M  Pulls 29  Tags Updated  3 months ago

  • gemma2

    Google Gemma 2 is a high-performing and efficient model available in three sizes: 2B, 9B, and 27B.

    2b 9b 27b

    19.7M  Pulls 94  Tags Updated  1 year ago

  • gemma

    Gemma is a family of lightweight, state-of-the-art open models built by Google DeepMind. Updated to version 1.1

    2b 7b

    6.6M  Pulls 102  Tags Updated  1 year ago

  • translategemma

    A new collection of open translation models built on Gemma 3, helping people communicate across 55 languages.

    vision 4b 12b 27b

    928.7K  Pulls 13  Tags Updated  2 months ago

  • functiongemma

    FunctionGemma is a specialized version of Google's Gemma 3 270M model fine-tuned explicitly for function calling.

    tools 270m

    129.6K  Pulls 4  Tags Updated  3 months ago

  • embeddinggemma

    EmbeddingGemma is a 300M parameter embedding model from Google.

    embedding 300m

    897.5K  Pulls 5  Tags Updated  6 months ago

  • codegemma

    CodeGemma is a collection of powerful, lightweight models that can perform a variety of coding tasks like fill-in-the-middle code completion, code generation, natural language understanding, mathematical reasoning, and instruction following.

    2b 7b

    2.7M  Pulls 85  Tags Updated  1 year ago

  • shieldgemma

    ShieldGemma is set of instruction tuned models for evaluating the safety of text prompt input and text output responses against a set of defined safety policies.

    2b 9b 27b

    705.1K  Pulls 49  Tags Updated  1 year ago

  • granite-embedding

    The IBM Granite Embedding 30M and 278M models models are text-only dense biencoder embedding models, with 30M available in English only and 278M serving multilingual use cases.

    embedding 30m 278m

    290.8K  Pulls 6  Tags Updated  1 year ago

  • granite3.2-vision

    A compact and efficient vision-language model, specifically designed for visual document understanding, enabling automated content extraction from tables, charts, infographics, plots, diagrams, and more.

    vision tools 2b

    861.1K  Pulls 5  Tags Updated  1 year ago

  • dolphin3

    Dolphin 3.0 Llama 3.1 8B 🐬 is the next generation of the Dolphin series of instruct-tuned models designed to be the ultimate general purpose local model, enabling coding, math, agentic, function calling, and general use cases.

    8b

    3.7M  Pulls 5  Tags Updated  1 year ago

  • mistral-large

    Mistral Large 2 is Mistral's new flagship model that is significantly more capable in code generation, mathematics, and reasoning with 128k context window and support for dozens of languages.

    tools 123b

    961.9K  Pulls 32  Tags Updated  1 year ago

  • glm4

    A strong multi-lingual general language model with competitive performance to Llama 3.

    9b

    882.8K  Pulls 32  Tags Updated  1 year ago

  • nous-hermes

    General use models based on Llama and Llama 2 from Nous Research.

    7b 13b

    892.6K  Pulls 63  Tags Updated  2 years ago

  • vicuna

    General use chat model based on Llama and Llama 2 with 2K to 16K context sizes.

    7b 13b 33b

    872.4K  Pulls 111  Tags Updated  2 years ago

  • phind-codellama

    Code generation model based on Code Llama.

    34b

    745.5K  Pulls 49  Tags Updated  2 years ago

  • wizardlm

    General use model based on Llama 2.

    674.6K  Pulls 73  Tags Updated  2 years ago

  • codeup

    Great code generation model based on Llama2.

    13b

    466K  Pulls 19  Tags Updated  2 years ago

© 2026 Ollama
Blog Contact