Gemma 4 models are designed to deliver frontier-level performance at each size. They are well-suited for reasoning, agentic workflows, coding, and multimodal understanding.
9.4M Pulls 30 Tags Updated 2 weeks ago
Gemma 3n models are designed for efficient execution on everyday devices such as laptops, tablets or phones.
1.6M Pulls 9 Tags Updated 10 months ago
The current, most capable model that runs on a single GPU.
36.8M Pulls 29 Tags Updated 5 months ago
Google Gemma 2 is a high-performing and efficient model available in three sizes: 2B, 9B, and 27B.
23.5M Pulls 94 Tags Updated 1 year ago
Gemma is a family of lightweight, state-of-the-art open models built by Google DeepMind. Updated to version 1.1
7.1M Pulls 102 Tags Updated 2 years ago
A new collection of open translation models built on Gemma 3, helping people communicate across 55 languages.
1.5M Pulls 13 Tags Updated 4 months ago
MedGemma is a collection of Gemma 3 variants that are trained for performance on medical text and image comprehension.
31.5K Pulls 9 Tags Updated 1 month ago
FunctionGemma is a specialized version of Google's Gemma 3 270M model fine-tuned explicitly for function calling.
158.8K Pulls 4 Tags Updated 5 months ago
MedGemma 1.5 4B is an updated version of the MedGemma 4B model.
17.4K Pulls 5 Tags Updated 1 month ago
EmbeddingGemma is a 300M parameter embedding model from Google.
1.2M Pulls 5 Tags Updated 8 months ago
CodeGemma is a collection of powerful, lightweight models that can perform a variety of coding tasks like fill-in-the-middle code completion, code generation, natural language understanding, mathematical reasoning, and instruction following.
3M Pulls 85 Tags Updated 1 year ago
ShieldGemma is set of instruction tuned models for evaluating the safety of text prompt input and text output responses against a set of defined safety policies.
872.2K Pulls 49 Tags Updated 1 year ago
The IBM Granite Embedding 30M and 278M models models are text-only dense biencoder embedding models, with 30M available in English only and 278M serving multilingual use cases.
327.5K Pulls 6 Tags Updated 1 year ago
A compact and efficient vision-language model, specifically designed for visual document understanding, enabling automated content extraction from tables, charts, infographics, plots, diagrams, and more.
912.1K Pulls 5 Tags Updated 1 year ago
Dolphin 3.0 Llama 3.1 8B 🐬 is the next generation of the Dolphin series of instruct-tuned models designed to be the ultimate general purpose local model, enabling coding, math, agentic, function calling, and general use cases.
3.8M Pulls 5 Tags Updated 1 year ago
Mistral Large 2 is Mistral's new flagship model that is significantly more capable in code generation, mathematics, and reasoning with 128k context window and support for dozens of languages.
1.2M Pulls 32 Tags Updated 1 year ago
A strong multi-lingual general language model with competitive performance to Llama 3.
1.1M Pulls 32 Tags Updated 1 year ago
General use models based on Llama and Llama 2 from Nous Research.
1.1M Pulls 63 Tags Updated 2 years ago
General use chat model based on Llama and Llama 2 with 2K to 16K context sizes.
1.1M Pulls 111 Tags Updated 2 years ago
Code generation model based on Code Llama.
915.3K Pulls 49 Tags Updated 2 years ago