Gemma 4 models are designed to deliver frontier-level performance at each size. They are well-suited for reasoning, agentic workflows, coding, and multimodal understanding.
281.7K Pulls 16 Tags Updated yesterday
Gemma 3n models are designed for efficient execution on everyday devices such as laptops, tablets or phones.
1.5M Pulls 9 Tags Updated 9 months ago
The current, most capable model that runs on a single GPU.
34.9M Pulls 29 Tags Updated 3 months ago
Google Gemma 2 is a high-performing and efficient model available in three sizes: 2B, 9B, and 27B.
19.7M Pulls 94 Tags Updated 1 year ago
Gemma is a family of lightweight, state-of-the-art open models built by Google DeepMind. Updated to version 1.1
6.6M Pulls 102 Tags Updated 1 year ago
A new collection of open translation models built on Gemma 3, helping people communicate across 55 languages.
928.7K Pulls 13 Tags Updated 2 months ago
FunctionGemma is a specialized version of Google's Gemma 3 270M model fine-tuned explicitly for function calling.
129.6K Pulls 4 Tags Updated 3 months ago
EmbeddingGemma is a 300M parameter embedding model from Google.
897.5K Pulls 5 Tags Updated 6 months ago
CodeGemma is a collection of powerful, lightweight models that can perform a variety of coding tasks like fill-in-the-middle code completion, code generation, natural language understanding, mathematical reasoning, and instruction following.
2.7M Pulls 85 Tags Updated 1 year ago
ShieldGemma is set of instruction tuned models for evaluating the safety of text prompt input and text output responses against a set of defined safety policies.
705.1K Pulls 49 Tags Updated 1 year ago
The IBM Granite Embedding 30M and 278M models models are text-only dense biencoder embedding models, with 30M available in English only and 278M serving multilingual use cases.
290.8K Pulls 6 Tags Updated 1 year ago
A compact and efficient vision-language model, specifically designed for visual document understanding, enabling automated content extraction from tables, charts, infographics, plots, diagrams, and more.
861.1K Pulls 5 Tags Updated 1 year ago
Dolphin 3.0 Llama 3.1 8B 🐬 is the next generation of the Dolphin series of instruct-tuned models designed to be the ultimate general purpose local model, enabling coding, math, agentic, function calling, and general use cases.
3.7M Pulls 5 Tags Updated 1 year ago
Mistral Large 2 is Mistral's new flagship model that is significantly more capable in code generation, mathematics, and reasoning with 128k context window and support for dozens of languages.
961.9K Pulls 32 Tags Updated 1 year ago
A strong multi-lingual general language model with competitive performance to Llama 3.
882.8K Pulls 32 Tags Updated 1 year ago
General use models based on Llama and Llama 2 from Nous Research.
892.6K Pulls 63 Tags Updated 2 years ago
General use chat model based on Llama and Llama 2 with 2K to 16K context sizes.
872.4K Pulls 111 Tags Updated 2 years ago
Code generation model based on Code Llama.
745.5K Pulls 49 Tags Updated 2 years ago
General use model based on Llama 2.
674.6K Pulls 73 Tags Updated 2 years ago
Great code generation model based on Llama2.
466K Pulls 19 Tags Updated 2 years ago