Models
-
gemma
Gemma is a family of lightweight, state-of-the-art open models built by Google DeepMind.
110.7K Pulls 69 Tags Updated 10 days ago
-
llama2
Llama 2 is a collection of foundation language models ranging from 7B to 70B parameters.
567.5K Pulls 102 Tags Updated 3 weeks ago
-
mistral
The 7B model released by Mistral AI, updated to version 0.2.
276.3K Pulls 53 Tags Updated 2 months ago
-
mixtral
A high-quality Mixture of Experts (MoE) model with open weights by Mistral AI.
83.1K Pulls 34 Tags Updated 4 weeks ago
-
llava
🌋 LLaVA is a novel end-to-end trained large multimodal model that combines a vision encoder and Vicuna for general-purpose visual and language understanding. Updated to version 1.6.
62.8K Pulls 98 Tags Updated 4 weeks ago
-
neural-chat
A fine-tuned model based on Mistral with good coverage of domain and language.
13.9K Pulls 50 Tags Updated 2 months ago
-
codellama
A large language model that can use text prompts to generate and discuss code.
198.8K Pulls 199 Tags Updated 4 weeks ago
-
dolphin-mixtral
An uncensored, fine-tuned model based on the Mixtral mixture of experts model that excels at coding tasks. Created by Eric Hartford.
132.2K Pulls 70 Tags Updated 2 months ago
-
mistral-openorca
Mistral OpenOrca is a 7 billion parameter model, fine-tuned on top of the Mistral 7B model using the OpenOrca dataset.
100.5K Pulls 17 Tags Updated 4 months ago
-
llama2-uncensored
Uncensored Llama 2 model by George Sung and Jarrad Hope.
80.6K Pulls 34 Tags Updated 4 months ago
-
phi
Phi-2: a 2.7B language model by Microsoft Research that demonstrates outstanding reasoning and language understanding capabilities.
54.6K Pulls 18 Tags Updated 4 weeks ago
-
orca-mini
A general-purpose model ranging from 3 billion parameters to 70 billion, suitable for entry-level hardware.
52.2K Pulls 119 Tags Updated 4 months ago
-
deepseek-coder
DeepSeek Coder is a capable coding model trained on two trillion code and natural language tokens.
49.0K Pulls 102 Tags Updated 2 months ago
-
dolphin-mistral
The uncensored Dolphin model based on Mistral that excels at coding tasks. Updated to version 2.6.
43.5K Pulls 103 Tags Updated 7 weeks ago
-
wizard-vicuna-uncensored
Wizard Vicuna Uncensored is a 7B, 13B, and 30B parameter model based on Llama 2 uncensored by Eric Hartford.
30.5K Pulls 49 Tags Updated 4 months ago
-
vicuna
General use chat model based on Llama and Llama 2 with 2K to 16K context sizes.
30.0K Pulls 111 Tags Updated 4 months ago
-
qwen
Qwen 1.5 is a series of large language models by Alibaba Cloud spanning from 0.5B to 72B parameters
25.3K Pulls 319 Tags Updated 3 weeks ago
-
zephyr
Zephyr beta is a fine-tuned 7B version of mistral that was trained on on a mix of publicly available, synthetic datasets.
22.5K Pulls 34 Tags Updated 2 months ago
-
openhermes
OpenHermes 2.5 is a 7B model fine-tuned by Teknium on Mistral with fully open datasets.
21.8K Pulls 35 Tags Updated 2 months ago
-
llama2-chinese
Llama 2 based model fine tuned to improve Chinese dialogue ability.
19.0K Pulls 35 Tags Updated 4 months ago
-
wizardcoder
State-of-the-art code generation model
18.8K Pulls 67 Tags Updated 8 weeks ago
-
tinyllama
The TinyLlama project is an open endeavor to train a compact 1.1B Llama model on 3 trillion tokens.
17.6K Pulls 36 Tags Updated 2 months ago
-
openchat
A family of open-source models trained on a wide variety of data, surpassing ChatGPT on various benchmarks. Updated to version 3.5-0106.
16.4K Pulls 50 Tags Updated 7 weeks ago
-
phind-codellama
Code generation model based on Code Llama.
16.4K Pulls 49 Tags Updated 2 months ago
-
orca2
Orca 2 is built by Microsoft research, and are a fine-tuned version of Meta's Llama 2 models. The model is designed to excel particularly in reasoning.
13.7K Pulls 33 Tags Updated 3 months ago
-
falcon
A large language model built by the Technology Innovation Institute (TII) for use in summarization, text generation, and chat bots.
13.1K Pulls 38 Tags Updated 4 months ago
-
tinydolphin
An experimental 1.1B parameter model trained on the new Dolphin 2.8 dataset by Eric Hartford and based on TinyLlama.
13.0K Pulls 18 Tags Updated 4 weeks ago
-
wizard-math
Model focused on math and logic problems
12.4K Pulls 64 Tags Updated 2 months ago
-
nous-hermes
General use models based on Llama and Llama 2 from Nous Research.
11.7K Pulls 63 Tags Updated 4 months ago
-
yi
A high-performing, bilingual language model.
11.6K Pulls 78 Tags Updated 2 months ago
-
dolphin-phi
2.7B uncensored Dolphin model by Eric Hartford, based on the Phi language model by Microsoft Research.
11.3K Pulls 15 Tags Updated 2 months ago
-
starcoder
StarCoder is a code generation model trained on 80+ programming languages.
10.1K Pulls 100 Tags Updated 4 months ago
-
starling-lm
Starling is a large language model trained by reinforcement learning from AI feedback focused on improving chatbot helpfulness.
9,998 Pulls 18 Tags Updated 3 months ago
-
codeup
Great code generation model based on Llama2.
9,009 Pulls 19 Tags Updated 4 months ago
-
stable-code
Stable Code 3B is a model offering accurate and responsive code completion at a level on par with models such as CodeLLaMA 7B that are 2.5x larger.
8,749 Pulls 18 Tags Updated 6 weeks ago
-
medllama2
Fine-tuned Llama 2 model to answer medical questions based on an open source medical dataset.
8,732 Pulls 17 Tags Updated 4 months ago
-
bakllava
BakLLaVA is a multimodal model consisting of the Mistral 7B base model augmented with the LLaVA architecture.
8,294 Pulls 17 Tags Updated 2 months ago
-
everythinglm
Uncensored Llama2 based model with support for a 16K context window.
8,138 Pulls 18 Tags Updated 2 months ago
-
wizardlm-uncensored
Uncensored version of Wizard LM model
8,136 Pulls 18 Tags Updated 4 months ago
-
solar
A compact, yet powerful 10.7B large language model designed for single-turn conversation.
7,769 Pulls 32 Tags Updated 2 months ago
-
stable-beluga
Llama 2 based model fine tuned on an Orca-style dataset. Originally called Free Willy.
7,088 Pulls 49 Tags Updated 4 months ago
-
sqlcoder
SQLCoder is a code completion model fined-tuned on StarCoder for SQL generation tasks
6,477 Pulls 48 Tags Updated 4 weeks ago
-
nous-hermes2-mixtral
The Nous Hermes 2 model from Nous Research, now trained over Mixtral.
6,121 Pulls 18 Tags Updated 6 weeks ago
-
yarn-mistral
An extension of Mistral to support context windows of 64K or 128K.
5,976 Pulls 33 Tags Updated 2 months ago
-
samantha-mistral
A companion assistant trained in philosophy, psychology, and personal relationships. Based on Mistral.
5,504 Pulls 49 Tags Updated 4 months ago
-
meditron
Open-source medical large language model adapted from Llama 2 to the medical domain.
5,293 Pulls 22 Tags Updated 3 months ago
-
stablelm-zephyr
A lightweight chat model allowing accurate, and responsive output without requiring high-end hardware.
5,229 Pulls 17 Tags Updated 2 months ago
-
stablelm2
Stable LM 2 1.6B is a state-of-the-art 1.6 billion parameter small language model trained on multilingual data in English, Spanish, German, Italian, French, Portuguese, and Dutch.
5,122 Pulls 34 Tags Updated 5 weeks ago
-
magicoder
🎩 Magicoder is a family of 7B parameter models trained on 75K synthetic instruction data using OSS-Instruct, a novel approach to enlightening LLMs with open-source code snippets.
4,921 Pulls 18 Tags Updated 3 months ago
-
wizard-vicuna
Wizard Vicuna is a 13B parameter model based on Llama 2 trained by MelodysDreamj.
4,878 Pulls 17 Tags Updated 4 months ago
-
yarn-llama2
An extension of Llama 2 that supports a context of up to 128k tokens.
4,701 Pulls 67 Tags Updated 4 months ago
-
nous-hermes2
The powerful family of models by Nous Research that excels at scientific discussion and coding tasks.
4,639 Pulls 33 Tags Updated 2 months ago
-
deepseek-llm
An advanced language model crafted with 2 trillion bilingual tokens.
4,379 Pulls 64 Tags Updated 2 months ago
-
llama-pro
An expansion of Llama 2 that specializes in integrating both general language understanding and domain-specific knowledge, particularly in programming and mathematics.
4,145 Pulls 33 Tags Updated 7 weeks ago
-
nomic-embed-text
A high-performing open embedding model with a large token context window.
4,135 Pulls 3 Tags Updated 3 days ago
-
codebooga
A high-performing code instruct model created by merging two existing code models.
3,807 Pulls 16 Tags Updated 4 months ago
-
open-orca-platypus2
Merge of the Open Orca OpenChat model and the Garage-bAInd Platypus 2 model. Designed for chat and code generation.
3,741 Pulls 17 Tags Updated 4 months ago
-
mistrallite
MistralLite is a fine-tuned model based on Mistral with enhanced capabilities of processing long contexts.
3,716 Pulls 17 Tags Updated 4 months ago
-
nexusraven
Nexus Raven is a 13B instruction tuned model for function calling tasks.
3,618 Pulls 32 Tags Updated 6 weeks ago
-
starcoder2
StarCoder2 is the next generation of transparently trained open code LLMs that comes in three sizes: 3B, 7B and 15B parameters. Requires Ollama 0.1.28 in pre-release.
2,997 Pulls 49 Tags Updated yesterday
-
goliath
A language model created by combining two fine-tuned Llama 2 70B models into one.
2,807 Pulls 16 Tags Updated 3 months ago
-
notux
A top-performing mixture of experts model, fine-tuned with high-quality data.
2,664 Pulls 18 Tags Updated 2 months ago
-
alfred
A robust conversational model designed to be used for both chat and instruct use cases.
2,223 Pulls 7 Tags Updated 3 months ago
-
megadolphin
MegaDolphin-2.2-120b is a transformation of Dolphin-2.2-70b created by interleaving the model with itself.
2,184 Pulls 19 Tags Updated 7 weeks ago
-
xwinlm
Conversational model based on Llama 2 that performs competitively on various benchmarks.
2,009 Pulls 80 Tags Updated 4 months ago
-
wizardlm
General use 70 billion parameter model based on Llama 2.
1,993 Pulls 73 Tags Updated 4 months ago
-
notus
A 7B chat model fine-tuned with high-quality data and based on Zephyr.
1,670 Pulls 18 Tags Updated 2 months ago
-
duckdb-nsql
7B parameter text-to-SQL model made by MotherDuck and Numbers Station.
1,526 Pulls 17 Tags Updated 5 weeks ago
-
all-minilm
Embedding models on very large sentence level datasets.
1,187 Pulls 8 Tags Updated 11 days ago