24B model that excels at using tools to explore codebases, editing multiple files and power software engineering agents.
39.5K Pulls 6 Tags Updated 5 days ago
123B model that excels at using tools to explore codebases, editing multiple files and power software engineering agents.
9,172 Pulls 6 Tags Updated 6 days ago
DeepSeek-R1 is a family of open reasoning models with performance approaching that of leading models, such as O3 and Gemini 2.5 Pro.
74.5M Pulls 35 Tags Updated 5 months ago
Dolphin 3.0 Llama 3.1 8B 🐬 is the next generation of the Dolphin series of instruct-tuned models designed to be the ultimate general purpose local model, enabling coding, math, agentic, function calling, and general use cases.
3.5M Pulls 5 Tags Updated 11 months ago
A strong Mixture-of-Experts (MoE) language model with 671B total parameters with 37B activated for each token.
3M Pulls 5 Tags Updated 11 months ago
DeepSeek Coder is a capable coding model trained on two trillion code and natural language tokens.
2.3M Pulls 102 Tags Updated 1 year ago
An open-source Mixture-of-Experts code language model that achieves performance comparable to GPT4-Turbo in code-specific tasks.
1.3M Pulls 64 Tags Updated 1 year ago
A fine-tuned version of Deepseek-R1-Distilled-Qwen-1.5B that surpasses the performance of OpenAI’s o1-preview with just 1.5B parameters on popular math evaluations.
867.2K Pulls 5 Tags Updated 10 months ago
2.7B uncensored Dolphin model by Eric Hartford, based on the Phi language model by Microsoft Research.
851.8K Pulls 15 Tags Updated 1 year ago
Uncensored, 8x7b and 8x22b fine-tuned models based on the Mixtral mixture of experts models that excels at coding tasks. Created by Eric Hartford.
774.8K Pulls 70 Tags Updated 12 months ago
Dolphin 2.9 is a new model with 8B and 70B sizes by Eric Hartford based on Llama 3 that has a variety of instruction, conversational, and coding skills.
637.8K Pulls 53 Tags Updated 1 year ago
Devstral: the best open source model for coding agents
548.2K Pulls 5 Tags Updated 5 months ago
The uncensored Dolphin model based on Mistral that excels at coding tasks. Updated to version 2.8.
460.6K Pulls 120 Tags Updated 1 year ago
DeepCoder is a fully open-Source 14B coder model at O3-mini level, with a 1.5B version also available.
455.6K Pulls 9 Tags Updated 8 months ago
An advanced language model crafted with 2 trillion bilingual tokens.
237.4K Pulls 64 Tags Updated 2 years ago
A strong, economical, and efficient Mixture-of-Experts language model.
226.9K Pulls 34 Tags Updated 1 year ago
DeepSeek-V3.1-Terminus is a hybrid model that supports both thinking mode and non-thinking mode.
207K Pulls 8 Tags Updated 2 months ago
A 7B and 15B uncensored variant of the Dolphin model family that excels at coding, based on StarCoder2.
141K Pulls 35 Tags Updated 1 year ago
DBRX is an open, general-purpose LLM created by Databricks.
132.5K Pulls 7 Tags Updated 1 year ago
An upgraded version of DeekSeek-V2 that integrates the general and coding abilities of both DeepSeek-V2-Chat and DeepSeek-Coder-V2-Instruct.
92.2K Pulls 7 Tags Updated 1 year ago