The 7B model released by Mistral AI, updated to version 0.3.
27M Pulls 84 Tags Updated 8 months ago
OLMo 2 is a new family of 7B and 13B models trained on up to 5T tokens. These models are on par with or better than equivalently sized fully open models, and competitive with open-weight models such as Llama 3.1 on English academic benchmarks.
3.6M Pulls 9 Tags Updated 1 year ago
Llama 2 is a collection of foundation language models ranging from 7B to 70B parameters.
6M Pulls 102 Tags Updated 2 years ago
StarCoder2 is the next generation of transparently trained open code LLMs that comes in three sizes: 3B, 7B and 15B parameters.
2.4M Pulls 67 Tags Updated 1 year ago
InternLM2.5 is a 7B parameter model tailored for practical scenarios with outstanding reasoning capability.
597.8K Pulls 65 Tags Updated 1 year ago
OpenHermes 2.5 is a 7B model fine-tuned by Teknium on Mistral with fully open datasets.
686.9K Pulls 35 Tags Updated 2 years ago
A 7B and 15B uncensored variant of the Dolphin model family that excels at coding, based on StarCoder2.
609.9K Pulls 35 Tags Updated 1 year ago
Stable Code 3B is a coding model with instruct and code completion variants on par with models such as Code Llama 7B that are 2.5x larger.
653.1K Pulls 36 Tags Updated 1 year ago
Wizard Vicuna Uncensored is a 7B, 13B, and 30B parameter model based on Llama 2 uncensored by Eric Hartford.
787.2K Pulls 49 Tags Updated 2 years ago
BakLLaVA is a multimodal model consisting of the Mistral 7B base model augmented with the LLaVA architecture.
654.2K Pulls 17 Tags Updated 2 years ago
MathΣtral: a 7B model designed for math reasoning and scientific discovery by Mistral AI.
344.5K Pulls 17 Tags Updated 1 year ago
Mistral OpenOrca is a 7 billion parameter model, fine-tuned on top of the Mistral 7B model using the OpenOrca dataset.
474.5K Pulls 17 Tags Updated 2 years ago
7B parameter text-to-SQL model made by MotherDuck and Numbers Station.
330.7K Pulls 17 Tags Updated 2 years ago
🎩 Magicoder is a family of 7B parameter models trained on 75K synthetic instruction data using OSS-Instruct, a novel approach to enlightening LLMs with open-source code snippets.
344.5K Pulls 18 Tags Updated 2 years ago
A 7B chat model fine-tuned with high-quality data and based on Zephyr.
331.5K Pulls 18 Tags Updated 2 years ago
A strong reasoning and agentic model from Z.ai with 744B total parameters (40B active), built for complex systems engineering and long-horizon tasks.
128.6K Pulls 1 Tag Updated 1 month ago
Q4_L_M version of PsychoCounsel-Llama3-8B
48 Pulls 1 Tag Updated 2 months ago
Llama 3.1 is a new state-of-the-art model from Meta available in 8B, 70B and 405B parameter sizes.
111.8M Pulls 93 Tags Updated 1 year ago
New state of the art 70B model. Llama 3.3 70B offers similar performance compared to the Llama 3.1 405B model.
3.5M Pulls 14 Tags Updated 1 year ago
Mistral Small 3 sets a new benchmark in the “small” Large Language Models category below 70B.
2.6M Pulls 21 Tags Updated 1 year ago