d · Ollama

deepseek-v3.2

DeepSeek-V3.2, a model that harmonizes high computational efficiency with superior reasoning and agent performance.

2.2M Pulls 1 Tag Updated 5 months ago

devstral-small-2

24B model that excels at using tools to explore codebases, editing multiple files and power software engineering agents.

vision tools cloud 24b

847.2K Pulls 6 Tags Updated 5 months ago

deepseek-v4-flash

DeepSeek-V4-Flash is a preview of the DeepSeek-V4 series, a Mixture-of-Experts model with 284B total parameters and 13B activated, built for efficient reasoning across a 1M-token context window.

tools thinking cloud

95.1K Pulls 1 Tag Updated 1 month ago

deepseek-v4-pro

DeepSeek-V4-Pro is a frontier Mixture-of-Experts model with a 1M-token context window and three reasoning modes.

tools thinking cloud

87.2K Pulls 1 Tag Updated 1 month ago

devstral-2

123B model that excels at using tools to explore codebases, editing multiple files and power software engineering agents.

tools cloud 123b

228.5K Pulls 6 Tags Updated 5 months ago

deepseek-ocr

DeepSeek-OCR is a vision-language model that can perform token-efficient OCR.

vision 3b

453.2K Pulls 3 Tags Updated 6 months ago

deepseek-v3.1

DeepSeek-V3.1-Terminus is a hybrid model that supports both thinking mode and non-thinking mode.

tools thinking cloud 671b

690K Pulls 8 Tags Updated 8 months ago

deepseek-r1

DeepSeek-R1 is a family of open reasoning models with performance approaching that of leading models, such as O3 and Gemini 2.5 Pro.

tools thinking 1.5b 7b 8b 14b 32b 70b 671b

86.5M Pulls 35 Tags Updated 11 months ago

deepseek-v3

A strong Mixture-of-Experts (MoE) language model with 671B total parameters with 37B activated for each token.

671b

3.8M Pulls 5 Tags Updated 1 year ago

dolphin3

Dolphin 3.0 Llama 3.1 8B 🐬 is the next generation of the Dolphin series of instruct-tuned models designed to be the ultimate general purpose local model, enabling coding, math, agentic, function calling, and general use cases.

8b

3.8M Pulls 5 Tags Updated 1 year ago

deepseek-coder

DeepSeek Coder is a capable coding model trained on two trillion code and natural language tokens.

1.3b 6.7b 33b

4.2M Pulls 102 Tags Updated 2 years ago

deepseek-coder-v2

An open-source Mixture-of-Experts code language model that achieves performance comparable to GPT4-Turbo in code-specific tasks.

16b 236b

2.6M Pulls 64 Tags Updated 1 year ago

devstral

Devstral: the best open source model for coding agents

tools 24b

949.1K Pulls 5 Tags Updated 10 months ago

deepscaler

A fine-tuned version of Deepseek-R1-Distilled-Qwen-1.5B that surpasses the performance of OpenAI’s o1-preview with just 1.5B parameters on popular math evaluations.

1.5b

1.2M Pulls 5 Tags Updated 1 year ago

deepcoder

DeepCoder is a fully open-Source 14B coder model at O3-mini level, with a 1.5B version also available.

1.5b 14b

872.4K Pulls 9 Tags Updated 1 year ago

dolphin-llama3

Dolphin 2.9 is a new model with 8B and 70B sizes by Eric Hartford based on Llama 3 that has a variety of instruction, conversational, and coding skills.

8b 70b

1.9M Pulls 53 Tags Updated 2 years ago

dolphin-mixtral

Uncensored, 8x7b and 8x22b fine-tuned models based on the Mixtral mixture of experts models that excels at coding tasks. Created by Eric Hartford.

8x7b 8x22b

1.8M Pulls 70 Tags Updated 1 year ago

dolphin-phi

2.7B uncensored Dolphin model by Eric Hartford, based on the Phi language model by Microsoft Research.

2.7b

1.6M Pulls 15 Tags Updated 2 years ago

dolphin-mistral

The uncensored Dolphin model based on Mistral that excels at coding tasks. Updated to version 2.8.

7b

1.5M Pulls 120 Tags Updated 2 years ago

deepseek-v2

A strong, economical, and efficient Mixture-of-Experts language model.

16b 236b

1.1M Pulls 34 Tags Updated 1 year ago