Ollama
Models Docs Pricing
Sign in Download
Models Download Docs Pricing Sign in
⇅
gemma · Ollama
Search for models on Ollama.
  • gemma4

    Gemma 4 models are designed to deliver frontier-level performance at each size. They are well-suited for reasoning, agentic workflows, coding, and multimodal understanding.

    vision tools thinking audio cloud e2b e4b 26b 31b

    9.4M  Pulls 30  Tags Updated  2 weeks ago

  • gemma3n

    Gemma 3n models are designed for efficient execution on everyday devices such as laptops, tablets or phones.

    e2b e4b

    1.6M  Pulls 9  Tags Updated  10 months ago

  • gemma3

    The current, most capable model that runs on a single GPU.

    vision cloud 270m 1b 4b 12b 27b

    36.8M  Pulls 29  Tags Updated  5 months ago

  • gemma2

    Google Gemma 2 is a high-performing and efficient model available in three sizes: 2B, 9B, and 27B.

    2b 9b 27b

    23.5M  Pulls 94  Tags Updated  1 year ago

  • gemma

    Gemma is a family of lightweight, state-of-the-art open models built by Google DeepMind. Updated to version 1.1

    2b 7b

    7.1M  Pulls 102  Tags Updated  2 years ago

  • translategemma

    A new collection of open translation models built on Gemma 3, helping people communicate across 55 languages.

    vision 4b 12b 27b

    1.5M  Pulls 13  Tags Updated  4 months ago

  • medgemma

    MedGemma is a collection of Gemma 3 variants that are trained for performance on medical text and image comprehension.

    vision 4b 27b

    31.5K  Pulls 9  Tags Updated  1 month ago

  • functiongemma

    FunctionGemma is a specialized version of Google's Gemma 3 270M model fine-tuned explicitly for function calling.

    tools 270m

    158.8K  Pulls 4  Tags Updated  5 months ago

  • medgemma1.5

    MedGemma 1.5 4B is an updated version of the MedGemma 4B model.

    vision 4b

    17.4K  Pulls 5  Tags Updated  1 month ago

  • embeddinggemma

    EmbeddingGemma is a 300M parameter embedding model from Google.

    embedding 300m

    1.2M  Pulls 5  Tags Updated  8 months ago

  • codegemma

    CodeGemma is a collection of powerful, lightweight models that can perform a variety of coding tasks like fill-in-the-middle code completion, code generation, natural language understanding, mathematical reasoning, and instruction following.

    2b 7b

    3M  Pulls 85  Tags Updated  1 year ago

  • shieldgemma

    ShieldGemma is set of instruction tuned models for evaluating the safety of text prompt input and text output responses against a set of defined safety policies.

    2b 9b 27b

    872.2K  Pulls 49  Tags Updated  1 year ago

  • granite-embedding

    The IBM Granite Embedding 30M and 278M models models are text-only dense biencoder embedding models, with 30M available in English only and 278M serving multilingual use cases.

    embedding 30m 278m

    327.5K  Pulls 6  Tags Updated  1 year ago

  • granite3.2-vision

    A compact and efficient vision-language model, specifically designed for visual document understanding, enabling automated content extraction from tables, charts, infographics, plots, diagrams, and more.

    vision tools 2b

    912.1K  Pulls 5  Tags Updated  1 year ago

  • dolphin3

    Dolphin 3.0 Llama 3.1 8B 🐬 is the next generation of the Dolphin series of instruct-tuned models designed to be the ultimate general purpose local model, enabling coding, math, agentic, function calling, and general use cases.

    8b

    3.8M  Pulls 5  Tags Updated  1 year ago

  • mistral-large

    Mistral Large 2 is Mistral's new flagship model that is significantly more capable in code generation, mathematics, and reasoning with 128k context window and support for dozens of languages.

    tools 123b

    1.2M  Pulls 32  Tags Updated  1 year ago

  • glm4

    A strong multi-lingual general language model with competitive performance to Llama 3.

    9b

    1.1M  Pulls 32  Tags Updated  1 year ago

  • nous-hermes

    General use models based on Llama and Llama 2 from Nous Research.

    7b 13b

    1.1M  Pulls 63  Tags Updated  2 years ago

  • vicuna

    General use chat model based on Llama and Llama 2 with 2K to 16K context sizes.

    7b 13b 33b

    1.1M  Pulls 111  Tags Updated  2 years ago

  • phind-codellama

    Code generation model based on Code Llama.

    34b

    915.3K  Pulls 49  Tags Updated  2 years ago

© 2026 Ollama
Blog Contact