The 7B model released by Mistral AI, updated to version 0.3.
19.9M Pulls 84 Tags Updated 2 months ago
Llama 2 is a collection of foundation language models ranging from 7B to 70B parameters.
4.1M Pulls 102 Tags Updated 1 year ago
OLMo 2 is a new family of 7B and 13B models trained on up to 5T tokens. These models are on par with or better than equivalently sized fully open models, and competitive with open-weight models such as Llama 3.1 on English academic benchmarks.
2.8M Pulls 9 Tags Updated 8 months ago
StarCoder2 is the next generation of transparently trained open code LLMs that comes in three sizes: 3B, 7B and 15B parameters.
1.3M Pulls 67 Tags Updated 1 year ago
Wizard Vicuna Uncensored is a 7B, 13B, and 30B parameter model based on Llama 2 uncensored by Eric Hartford.
246K Pulls 49 Tags Updated 1 year ago
Mistral OpenOrca is a 7 billion parameter model, fine-tuned on top of the Mistral 7B model using the OpenOrca dataset.
187.2K Pulls 17 Tags Updated 1 year ago
OpenHermes 2.5 is a 7B model fine-tuned by Teknium on Mistral with fully open datasets.
184.3K Pulls 35 Tags Updated 1 year ago
Stable Code 3B is a coding model with instruct and code completion variants on par with models such as Code Llama 7B that are 2.5x larger.
153.9K Pulls 36 Tags Updated 1 year ago
BakLLaVA is a multimodal model consisting of the Mistral 7B base model augmented with the LLaVA architecture.
140K Pulls 17 Tags Updated 1 year ago
A 7B and 15B uncensored variant of the Dolphin model family that excels at coding, based on StarCoder2.
109K Pulls 35 Tags Updated 1 year ago
InternLM2.5 is a 7B parameter model tailored for practical scenarios with outstanding reasoning capability.
99.7K Pulls 65 Tags Updated 1 year ago
MathΣtral: a 7B model designed for math reasoning and scientific discovery by Mistral AI.
55.5K Pulls 17 Tags Updated 1 year ago
7B parameter text-to-SQL model made by MotherDuck and Numbers Station.
47.3K Pulls 17 Tags Updated 1 year ago
🎩 Magicoder is a family of 7B parameter models trained on 75K synthetic instruction data using OSS-Instruct, a novel approach to enlightening LLMs with open-source code snippets.
45.3K Pulls 18 Tags Updated 1 year ago
A 7B chat model fine-tuned with high-quality data and based on Zephyr.
34.9K Pulls 18 Tags Updated 1 year ago
A general-purpose model ranging from 3 billion parameters to 70 billion, suitable for entry-level hardware.
752.5K Pulls 119 Tags Updated 1 year ago
Dolphin 2.9 is a new model with 8B and 70B sizes by Eric Hartford based on Llama 3 that has a variety of instruction, conversational, and coding skills.
454.7K Pulls 53 Tags Updated 1 year ago
Supports translation between English, French, Chinese(Mandarin) and Japanese.
4,579 Pulls 1 Tag Updated 1 year ago
gemma-2-2b-jpn-it-translate is a model tuned for translation tasks based on google/gemma-2-2b-jpn-it released by Google.
935 Pulls 1 Tag Updated 11 months ago
This model is based on google/gemma-2-2b-jpn-it, enhanced with multiple tuning techniques to improve its general performance.
931 Pulls 1 Tag Updated 11 months ago