-
llama3
Meta Llama 3: The most capable openly available LLM to date
586.8K Pulls 67 Tags Updated 10 days ago
-
phi3
Phi-3 Mini is a 3.8B parameters, lightweight, state-of-the-art open model by Microsoft.
65.9K Pulls 6 Tags Updated 9 days ago
-
wizardlm2
State of the art large language model from Microsoft AI with improved performance on complex chat, multilingual, reasoning and agent use cases.
35.4K Pulls 22 Tags Updated 2 weeks ago
-
mistral
The 7B model released by Mistral AI, updated to version 0.2.
687.8K Pulls 68 Tags Updated 5 weeks ago
-
gemma
Gemma is a family of lightweight, state-of-the-art open models built by Google DeepMind. Updated to version 1.1
1.1M Pulls 102 Tags Updated 3 weeks ago
-
mixtral
A set of Mixture of Experts (MoE) model with open weights by Mistral AI in 8x7b and 8x22b parameter sizes.
207.3K Pulls 58 Tags Updated 16 minutes ago
-
llama2
Llama 2 is a collection of foundation language models ranging from 7B to 70B parameters.
1.4M Pulls 102 Tags Updated 2 months ago
-
codegemma
CodeGemma is a collection of powerful, lightweight models that can perform a variety of coding tasks like fill-in-the-middle code completion, code generation, natural language understanding, mathematical reasoning, and instruction following.
36.3K Pulls 53 Tags Updated 2 weeks ago
-
command-r
Command R is a Large Language Model optimized for conversational interaction and long context tasks.
28.9K Pulls 17 Tags Updated 5 weeks ago
-
command-r-plus
Command R+ is a powerful, scalable large language model purpose-built to excel at real-world enterprise use cases.
24.2K Pulls 6 Tags Updated 2 weeks ago
-
llava
🌋 LLaVA is a novel end-to-end trained large multimodal model that combines a vision encoder and Vicuna for general-purpose visual and language understanding. Updated to version 1.6.
169.1K Pulls 98 Tags Updated 3 months ago
-
dbrx
DBRX is an open, general-purpose LLM created by Databricks.
4,158 Pulls 7 Tags Updated 2 weeks ago
-
llama3-gradient
This model extends LLama-3 8B's context length from 8k to over 1m tokens.
4,636 Pulls 19 Tags Updated 2 days ago
-
moondream
moondream is a small vision language model designed to run efficiently on edge devices.
2,006 Pulls 18 Tags Updated 4 days ago
-
dolphin-llama3
Dolphin 2.9 is a new model with 8B and 70B sizes by Eric Hartford based on Llama 3 that has a variety of instruction, conversational, and coding skills.
21.5K Pulls 54 Tags Updated 3 days ago
-
codeqwen
CodeQwen1.5 is a large language model pretrained on a large amount of code data.
9,909 Pulls 21 Tags Updated 2 weeks ago
-
snowflake-arctic-embed
A suite of text embedding models by Snowflake, optimized for performance.
3,763 Pulls 16 Tags Updated 2 weeks ago
-
mxbai-embed-large
State-of-the-art large embedding model from mixedbread.ai
21.9K Pulls 3 Tags Updated 5 weeks ago
-
dolphincoder
A 7B and 15B uncensored variant of the Dolphin model family that excels at coding, based on StarCoder2.
18.3K Pulls 35 Tags Updated 3 weeks ago
-
starcoder2
StarCoder2 is the next generation of transparently trained open code LLMs that comes in three sizes: 3B, 7B and 15B parameters.
39.1K Pulls 67 Tags Updated yesterday
-
all-minilm
Embedding models on very large sentence level datasets.
12K Pulls 8 Tags Updated 2 months ago
-
nomic-embed-text
A high-performing open embedding model with a large token context window.
85.2K Pulls 3 Tags Updated 2 months ago
-
stablelm2
Stable LM 2 is a state-of-the-art 1.6B parameter language model trained on multilingual data in English, Spanish, German, Italian, French, Portuguese, and Dutch.
13.2K Pulls 51 Tags Updated 3 weeks ago
-
duckdb-nsql
7B parameter text-to-SQL model made by MotherDuck and Numbers Station.
4,604 Pulls 17 Tags Updated 3 months ago
-
qwen
Qwen 1.5 is a series of large language models by Alibaba Cloud spanning from 0.5B to 110B parameters
249.3K Pulls 379 Tags Updated 6 days ago