Dolphin 3.0 Llama 3.1 8B 🐬 is the next generation of the Dolphin series of instruct-tuned models designed to be the ultimate general purpose local model, enabling coding, math, agentic, function calling, and general use cases.
3.5M Pulls 5 Tags Updated 11 months ago
Llama 3.2 Vision is a collection of instruction-tuned image reasoning generative models in 11B and 90B sizes.
3.3M Pulls 9 Tags Updated 7 months ago
A LLaVA model fine-tuned from Llama 3 Instruct with better scores in several benchmarks.
2.1M Pulls 4 Tags Updated 1 year ago
An update to Mistral Small that improves on function calling, instruction following, and less repetition errors.
892.1K Pulls 5 Tags Updated 6 months ago
IBM Granite 2B and 8B models are 128K context length language models that have been fine-tuned for improved reasoning and instruction-following capabilities.
773.5K Pulls 3 Tags Updated 8 months ago
Dolphin 2.9 is a new model with 8B and 70B sizes by Eric Hartford based on Llama 3 that has a variety of instruction, conversational, and coding skills.
638.5K Pulls 53 Tags Updated 1 year ago
Granite 4 features improved instruction following (IF) and tool-calling capabilities, making them more effective in enterprise applications.
296K Pulls 17 Tags Updated 1 month ago
Stable Code 3B is a coding model with instruct and code completion variants on par with models such as Code Llama 7B that are 2.5x larger.
181.2K Pulls 36 Tags Updated 1 year ago
EXAONE 3.5 is a collection of instruction-tuned bilingual (English and Korean) generative models ranging from 2.4B to 32B parameters, developed and released by LG AI Research.
146.8K Pulls 13 Tags Updated 1 year ago
Tülu 3 is a leading instruction following model family, offering fully open-source data, code, and recipes by the The Allen Institute for AI.
129.4K Pulls 9 Tags Updated 12 months ago
Llama-3.1-Nemotron-70B-Instruct is a large language model customized by NVIDIA to improve the helpfulness of LLM generated responses to user queries.
120.2K Pulls 17 Tags Updated 1 year ago
A new small reasoning model fine-tuned from the Qwen 2.5 3B Instruct model.
107.8K Pulls 5 Tags Updated 11 months ago
An upgraded version of DeekSeek-V2 that integrates the general and coding abilities of both DeepSeek-V2-Chat and DeepSeek-Coder-V2-Instruct.
92.3K Pulls 7 Tags Updated 1 year ago
ShieldGemma is set of instruction tuned models for evaluating the safety of text prompt input and text output responses against a set of defined safety policies.
83.8K Pulls 49 Tags Updated 1 year ago
Nexus Raven is a 13B instruction tuned model for function calling tasks.
77.2K Pulls 32 Tags Updated 1 year ago
🎩 Magicoder is a family of 7B parameter models trained on 75K synthetic instruction data using OSS-Instruct, a novel approach to enlightening LLMs with open-source code snippets.
68.4K Pulls 18 Tags Updated 2 years ago
A high-performing code instruct model created by merging two existing code models.
68K Pulls 16 Tags Updated 2 years ago
A robust conversational model designed to be used for both chat and instruct use cases.
50.5K Pulls 7 Tags Updated 2 years ago
A state-of-the-art mixture-of-experts (MoE) language model. Kimi K2-Instruct-0905 demonstrates significant improvements in performance on public benchmarks and real-world coding agent tasks.
21K Pulls 1 Tag Updated 2 months ago
The Cogito v2.1 LLMs are instruction tuned generative models. All models are released under MIT license for commercial use.
18.6K Pulls 6 Tags Updated 3 weeks ago