The 7B model released by Mistral AI, updated to version 0.3.
24.8M Pulls 84 Tags Updated 6 months ago
OLMo 2 is a new family of 7B and 13B models trained on up to 5T tokens. These models are on par with or better than equivalently sized fully open models, and competitive with open-weight models such as Llama 3.1 on English academic benchmarks.
3.5M Pulls 9 Tags Updated 1 year ago
Llama 2 is a collection of foundation language models ranging from 7B to 70B parameters.
5.3M Pulls 102 Tags Updated 2 years ago
StarCoder2 is the next generation of transparently trained open code LLMs that comes in three sizes: 3B, 7B and 15B parameters.
2M Pulls 67 Tags Updated 1 year ago
BakLLaVA is a multimodal model consisting of the Mistral 7B base model augmented with the LLaVA architecture.
475.4K Pulls 17 Tags Updated 2 years ago
InternLM2.5 is a 7B parameter model tailored for practical scenarios with outstanding reasoning capability.
303.6K Pulls 65 Tags Updated 1 year ago
OpenHermes 2.5 is a 7B model fine-tuned by Teknium on Mistral with fully open datasets.
385K Pulls 35 Tags Updated 2 years ago
Wizard Vicuna Uncensored is a 7B, 13B, and 30B parameter model based on Llama 2 uncensored by Eric Hartford.
468K Pulls 49 Tags Updated 2 years ago
Stable Code 3B is a coding model with instruct and code completion variants on par with models such as Code Llama 7B that are 2.5x larger.
354.3K Pulls 36 Tags Updated 1 year ago
A 7B and 15B uncensored variant of the Dolphin model family that excels at coding, based on StarCoder2.
314.1K Pulls 35 Tags Updated 1 year ago
Mistral OpenOrca is a 7 billion parameter model, fine-tuned on top of the Mistral 7B model using the OpenOrca dataset.
309.7K Pulls 17 Tags Updated 2 years ago
MathΣtral: a 7B model designed for math reasoning and scientific discovery by Mistral AI.
179.7K Pulls 17 Tags Updated 1 year ago
7B parameter text-to-SQL model made by MotherDuck and Numbers Station.
168.4K Pulls 17 Tags Updated 2 years ago
🎩 Magicoder is a family of 7B parameter models trained on 75K synthetic instruction data using OSS-Instruct, a novel approach to enlightening LLMs with open-source code snippets.
171.6K Pulls 18 Tags Updated 2 years ago
A 7B chat model fine-tuned with high-quality data and based on Zephyr.
160K Pulls 18 Tags Updated 2 years ago
Q4_L_M version of PsychoCounsel-Llama3-8B
19 Pulls 1 Tag Updated 1 month ago
Llama 3.1 is a new state-of-the-art model from Meta available in 8B, 70B and 405B parameter sizes.
109.6M Pulls 93 Tags Updated 1 year ago
New state of the art 70B model. Llama 3.3 70B offers similar performance compared to the Llama 3.1 405B model.
3.2M Pulls 14 Tags Updated 1 year ago
Mistral Small 3 sets a new benchmark in the “small” Large Language Models category below 70B.
2.3M Pulls 21 Tags Updated 1 year ago
A general-purpose model ranging from 3 billion parameters to 70 billion, suitable for entry-level hardware.
1.9M Pulls 119 Tags Updated 2 years ago