Deepseek-R1

DeepSeek-R1 is a family of open reasoning models with performance approaching that of leading models, such as O3 and Gemini 2.5 Pro.

tools thinking 1.5b 7b 8b 14b 32b 70b 671b

67.1M Pulls 35 Tags Updated 3 months ago

deepscaler

A fine-tuned version of Deepseek-R1-Distilled-Qwen-1.5B that surpasses the performance of OpenAI’s o1-preview with just 1.5B parameters on popular math evaluations.

1.5b

533.1K Pulls 5 Tags Updated 8 months ago

openthinker

A fully open-source family of reasoning models built using a dataset derived by distilling DeepSeek-R1.

7b 32b

603.4K Pulls 15 Tags Updated 6 months ago

r1-1776

A version of the DeepSeek-R1 model that has been post trained to provide unbiased, accurate, and factual information by Perplexity.

70b 671b

119.6K Pulls 9 Tags Updated 8 months ago

tom_himanen/deepseek-r1-roo-cline-tools

This version of Deepseek R1 is optimized for tool usage with Cline and Roo Code.

tools 1.5b 7b 8b 14b 32b 70b

16.3K Pulls 510 Tags Updated 8 months ago

GandalfBaum/deepseek_r1-claude3.7

Deepseek R1 with the Claude 3.7 Sonnet system prompt. Inspired by incept5/llama3.1-claude

4,770 Pulls 1 Tag Updated 7 months ago

heatxsink/cline-tools.deepseek-r1

Deepseek R1 optimized for tool usage with Cline.

tools 8b 14b 32b

1,565 Pulls 3 Tags Updated 7 months ago

huihui_ai/tinyr1-abliterated

Tiny-R1-32B-Preview, which outperforms the 70B model Deepseek-R1-Distill-Llama-70B and nearly matches the full R1 model in math.

tools 32b

1,281 Pulls 6 Tags Updated 7 months ago

huihui_ai/deepscaler-abliterated

A fine-tuned version of Deepseek-R1-Distilled-Qwen-1.5B that surpasses the performance of OpenAI’s o1-preview with just 1.5B parameters on popular math evaluations.

1.5b

892 Pulls 3 Tags Updated 8 months ago

GandalfBaum/deepseek_r1-claude

Deepseek R1 with the Claude 3.5 Sonnet system prompt. Inspired by incept5/llama3.1-claude

771 Pulls 1 Tag Updated 8 months ago

Doppelfelix/Deepseek-r1-history-expert

(Mostly) Uncensored Deepseek R1 based on unsloth/deepseek-r1-distill-qwen-7b-unsloth-bnb-4bit.

585 Pulls 1 Tag Updated 8 months ago

Yinr/deepseek-r1-ablated

Deepseek R1 ablated version from mradermacher's gguf

32b

452 Pulls 1 Tag Updated 8 months ago

zhinao/tiny-r1

Qihoo 360's first-generation reasoning model, Tiny-R1-32B-Preview, which outperforms the 70B model Deepseek-R1-Distill-Llama-70B and nearly matches the full R1 model in math.

32b

137 Pulls 1 Tag Updated 7 months ago

SimonPu/deepcoder

DeepCoder-14B-Preview, a code reasoning model finetuned from Deepseek-R1-Distilled-Qwen-14B via distributed RL

82 Pulls 1 Tag Updated 5 months ago

yashagrawal/ReAct-Deepseek-R1-Distill-Llama-8B-bnb-4bit

20 Pulls 1 Tag Updated 8 months ago

secfa/DeepSeek-R1-UD-IQ1_S

Unsloth's DeepSeek-R1 , I just merged the thing and uploaded it here. This is the full 671b model. MoE Bits:1.58bit Type:UD-IQ1_S Disk Size:131GB Accuracy:Fair Details:MoE all 1.56bit. down_proj in MoE mixture of 2.06/1.56bit

170.9K Pulls 2 Tags Updated 8 months ago

SIGJNF/deepseek-r1-671b-1.58bit

Unsloth's DeepSeek-R1 1.58-bit, I just merged the thing and uploaded it here. This is the full 671b model, albeit dynamically quantized to 1.58bits.

101.2K Pulls 1 Tag Updated 8 months ago

Huzderu/deepseek-r1-671b-2.51bit

Merged GGUF Unsloth's DeepSeek-R1 671B 2.51bit dynamic quant

60.4K Pulls 1 Tag Updated 8 months ago

hengwen/DeepSeek-R1-Distill-Qwen-32B

DeepSeek-R1-Distill models are fine-tuned based on open-source models, using samples generated by DeepSeek-R1. We slightly change their configs and tokenizers. Please use our setting to run these models.

45.9K Pulls 2 Tags Updated 9 months ago

thirdeyeai/DeepSeek-R1-Distill-Qwen-7B-uncensored

26.9K Pulls 4 Tags Updated 5 months ago