DeepSeek-R1 is a family of open reasoning models with performance approaching that of leading models, such as O3 and Gemini 2.5 Pro.
81.8M Pulls 35 Tags Updated 9 months ago
A fine-tuned version of Deepseek-R1-Distilled-Qwen-1.5B that surpasses the performance of OpenAI’s o1-preview with just 1.5B parameters on popular math evaluations.
1.2M Pulls 5 Tags Updated 1 year ago
A fully open-source family of reasoning models built using a dataset derived by distilling DeepSeek-R1.
1M Pulls 15 Tags Updated 12 months ago
A version of the DeepSeek-R1 model that has been post trained to provide unbiased, accurate, and factual information by Perplexity.
360K Pulls 9 Tags Updated 1 year ago
DeepCoder-14B-Preview, a code reasoning model finetuned from Deepseek-R1-Distilled-Qwen-14B via distributed RL
126 Pulls 1 Tag Updated 11 months ago
This version of Deepseek R1 is optimized for tool usage with Cline and Roo Code.
18.6K Pulls 510 Tags Updated 1 year ago
Deepseek R1 with the Claude 3.7 Sonnet system prompt. Inspired by incept5/llama3.1-claude
5,435 Pulls 1 Tag Updated 1 year ago
Deepseek R1 optimized for tool usage with Cline.
1,861 Pulls 3 Tags Updated 1 year ago
1,857 Pulls 3 Tags Updated 1 year ago
Tiny-R1-32B-Preview, which outperforms the 70B model Deepseek-R1-Distill-Llama-70B and nearly matches the full R1 model in math.
1,557 Pulls 6 Tags Updated 1 year ago
Deepseek R1 with the Claude 3.5 Sonnet system prompt. Inspired by incept5/llama3.1-claude
850 Pulls 1 Tag Updated 1 year ago
(Mostly) Uncensored Deepseek R1 based on unsloth/deepseek-r1-distill-qwen-7b-unsloth-bnb-4bit.
680 Pulls 1 Tag Updated 1 year ago
Deepseek R1 ablated version from mradermacher's gguf
486 Pulls 1 Tag Updated 1 year ago
Qihoo 360's first-generation reasoning model, Tiny-R1-32B-Preview, which outperforms the 70B model Deepseek-R1-Distill-Llama-70B and nearly matches the full R1 model in math.
196 Pulls 1 Tag Updated 1 year ago
22 Pulls 1 Tag Updated 1 year ago
Based on DeepSeek R1 because OpenCode tries to verify on the registry for tool compatibility
130 Pulls 1 Tag Updated 3 weeks ago
Huggingface link - https://huggingface.co/iradukunda-dev/law-finetuned-DeepSeek-R1-Distill-Qwen-7B
273 Pulls 1 Tag Updated 2 months ago
基于 DeepSeek-R1-Distill-Qwen-1.5B 微调的中文轻量对话模型,自带猫娘口癖与亲昵风格。
295 Pulls 1 Tag Updated 5 months ago
SmallCoder is a compact reasoning-focused coding model, fine-tuned from DeepSeek-R1 1.5B using a code dataset that includes step-by-step reasoning.
148 Pulls 1 Tag Updated 1 month ago
Deepseek-r1
142 Pulls 1 Tag Updated 2 months ago