Deepseek · Ollama Search

deepseek-r1

DeepSeek-R1 is a family of open reasoning models with performance approaching that of leading models, such as O3 and Gemini 2.5 Pro.

tools thinking 1.5b 7b 8b 14b 32b 70b 671b

74.5M Pulls 35 Tags Updated 5 months ago

deepseek-v3

A strong Mixture-of-Experts (MoE) language model with 671B total parameters with 37B activated for each token.

671b

3M Pulls 5 Tags Updated 11 months ago

deepseek-coder

DeepSeek Coder is a capable coding model trained on two trillion code and natural language tokens.

1.3b 6.7b 33b

2.3M Pulls 102 Tags Updated 1 year ago

deepseek-coder-v2

An open-source Mixture-of-Experts code language model that achieves performance comparable to GPT4-Turbo in code-specific tasks.

16b 236b

1.3M Pulls 64 Tags Updated 1 year ago

deepseek-llm

An advanced language model crafted with 2 trillion bilingual tokens.

7b 67b

237.6K Pulls 64 Tags Updated 2 years ago

deepseek-v2

A strong, economical, and efficient Mixture-of-Experts language model.

16b 236b

227.2K Pulls 34 Tags Updated 1 year ago

deepseek-v3.1

DeepSeek-V3.1-Terminus is a hybrid model that supports both thinking mode and non-thinking mode.

tools thinking cloud 671b

207.9K Pulls 8 Tags Updated 2 months ago

deepseek-v2.5

An upgraded version of DeekSeek-V2 that integrates the general and coding abilities of both DeepSeek-V2-Chat and DeepSeek-Coder-V2-Instruct.

236b

92.4K Pulls 7 Tags Updated 1 year ago

deepseek-ocr

DeepSeek-OCR is a vision-language model that can perform token-efficient OCR.

vision 3b

65.5K Pulls 3 Tags Updated 4 weeks ago

deepseek-v3.2

DeepSeek-V3.2, a model that harmonizes high computational efficiency with superior reasoning and agent performance.

cloud

4,811 Pulls 1 Tag Updated 1 week ago

deepscaler

A fine-tuned version of Deepseek-R1-Distilled-Qwen-1.5B that surpasses the performance of OpenAI’s o1-preview with just 1.5B parameters on popular math evaluations.

1.5b

870.6K Pulls 5 Tags Updated 10 months ago

openthinker

A fully open-source family of reasoning models built using a dataset derived by distilling DeepSeek-R1.

7b 32b

633.3K Pulls 15 Tags Updated 8 months ago

r1-1776

A version of the DeepSeek-R1 model that has been post trained to provide unbiased, accurate, and factual information by Perplexity.

70b 671b

156.7K Pulls 9 Tags Updated 10 months ago

deepseek-140B/DeepSeekAI140B

5,662 Pulls 1 Tag Updated 10 months ago

tom_himanen/deepseek-r1-roo-cline-tools

This version of Deepseek R1 is optimized for tool usage with Cline and Roo Code.

tools 1.5b 7b 8b 14b 32b 70b

17.1K Pulls 510 Tags Updated 10 months ago

GandalfBaum/deepseek_r1-claude3.7

Deepseek R1 with the Claude 3.7 Sonnet system prompt. Inspired by incept5/llama3.1-claude

5,018 Pulls 1 Tag Updated 9 months ago

heatxsink/cline-tools.deepseek-r1

Deepseek R1 optimized for tool usage with Cline.

tools 8b 14b 32b

1,661 Pulls 3 Tags Updated 9 months ago

huihui_ai/tinyr1-abliterated

Tiny-R1-32B-Preview, which outperforms the 70B model Deepseek-R1-Distill-Llama-70B and nearly matches the full R1 model in math.

tools 32b

1,321 Pulls 6 Tags Updated 9 months ago

huihui_ai/deepscaler-abliterated

A fine-tuned version of Deepseek-R1-Distilled-Qwen-1.5B that surpasses the performance of OpenAI’s o1-preview with just 1.5B parameters on popular math evaluations.

1.5b

1,021 Pulls 3 Tags Updated 10 months ago

ukjin/Qwen3-30B-A3B-Thinking-2507-Deepseek-v3.1-Distill

This model is a distilled version of Qwen/Qwen3-30B-A3B-Instruct designed to inherit the reasoning and behavioral characteristics of its much larger teacher model, deepseek-ai/DeepSeek-V3.1.

tools thinking 4b

859 Pulls 2 Tags Updated 3 months ago