gemma

gemma4

Gemma 4 models are designed to deliver frontier-level performance at each size. They are well-suited for reasoning, agentic workflows, coding, and multimodal understanding.

vision tools thinking audio cloud e2b e4b 12b 26b 31b

19.5M Pulls 49 Tags Updated 3 weeks ago

gemma3

The current, most capable model that runs on a single GPU.

vision 270m 1b 4b 12b 27b

38.9M Pulls 26 Tags Updated 11 months ago

gemma2

Google Gemma 2 is a high-performing and efficient model available in three sizes: 2B, 9B, and 27B.

2b 9b 27b

28.6M Pulls 94 Tags Updated 1 year ago

Gemma is a family of lightweight, state-of-the-art open models built by Google DeepMind. Updated to version 1.1

2b 7b

7.3M Pulls 102 Tags Updated 2 years ago

gemma3n

Gemma 3n models are designed for efficient execution on everyday devices such as laptops, tablets or phones.

e2b e4b

1.9M Pulls 9 Tags Updated 1 year ago

medgemma

MedGemma is a collection of Gemma 3 variants that are trained for performance on medical text and image comprehension.

vision 4b 27b

203.1K Pulls 9 Tags Updated 3 months ago

medgemma1.5

MedGemma 1.5 4B is an updated version of the MedGemma 4B model.

vision 4b

95.7K Pulls 5 Tags Updated 3 months ago

translategemma

A new collection of open translation models built on Gemma 3, helping people communicate across 55 languages.

vision 4b 12b 27b

1.8M Pulls 13 Tags Updated 6 months ago

embeddinggemma

EmbeddingGemma is a 300M parameter embedding model from Google.

embedding 300m

1.5M Pulls 5 Tags Updated 10 months ago

functiongemma

FunctionGemma is a specialized version of Google's Gemma 3 270M model fine-tuned explicitly for function calling.

tools 270m

174.7K Pulls 4 Tags Updated 7 months ago

codegemma

CodeGemma is a collection of powerful, lightweight models that can perform a variety of coding tasks like fill-in-the-middle code completion, code generation, natural language understanding, mathematical reasoning, and instruction following.

2b 7b

3.1M Pulls 85 Tags Updated 2 years ago

shieldgemma

ShieldGemma is set of instruction tuned models for evaluating the safety of text prompt input and text output responses against a set of defined safety policies.

2b 9b 27b

900K Pulls 49 Tags Updated 1 year ago

granite-embedding

The IBM Granite Embedding 30M and 278M models models are text-only dense biencoder embedding models, with 30M available in English only and 278M serving multilingual use cases.

embedding 30m 278m

342.7K Pulls 6 Tags Updated 1 year ago

granite3.2-vision

A compact and efficient vision-language model, specifically designed for visual document understanding, enabling automated content extraction from tables, charts, infographics, plots, diagrams, and more.

vision tools 2b

951K Pulls 5 Tags Updated 1 year ago

dolphin3

Dolphin 3.0 Llama 3.1 8B 🐬 is the next generation of the Dolphin series of instruct-tuned models designed to be the ultimate general purpose local model, enabling coding, math, agentic, function calling, and general use cases.

8b

3.9M Pulls 5 Tags Updated 1 year ago

mistral-large

Mistral Large 2 is Mistral's new flagship model that is significantly more capable in code generation, mathematics, and reasoning with 128k context window and support for dozens of languages.

tools 123b

1.2M Pulls 32 Tags Updated 1 year ago

glm4

A strong multi-lingual general language model with competitive performance to Llama 3.

9b

1.2M Pulls 32 Tags Updated 2 years ago

nous-hermes

General use models based on Llama and Llama 2 from Nous Research.

7b 13b

1.2M Pulls 63 Tags Updated 2 years ago

vicuna

General use chat model based on Llama and Llama 2 with 2K to 16K context sizes.

7b 13b 33b

1.1M Pulls 111 Tags Updated 2 years ago

phind-codellama

Code generation model based on Code Llama.

34b

943.3K Pulls 49 Tags Updated 2 years ago