DeepSeek-R1 is a family of open reasoning models with performance approaching that of leading models, such as O3 and Gemini 2.5 Pro.
73.7M Pulls 35 Tags Updated 5 months ago
A fine-tuned version of Deepseek-R1-Distilled-Qwen-1.5B that surpasses the performance of OpenAI’s o1-preview with just 1.5B parameters on popular math evaluations.
797.5K Pulls 5 Tags Updated 9 months ago
A fully open-source family of reasoning models built using a dataset derived by distilling DeepSeek-R1.
621K Pulls 15 Tags Updated 8 months ago
A version of the DeepSeek-R1 model that has been post trained to provide unbiased, accurate, and factual information by Perplexity.
146.8K Pulls 9 Tags Updated 9 months ago
DeepSeek's first generation reasoning models with comparable performance to OpenAI-o1.
603.6K Pulls 55 Tags Updated 6 months ago
This is a modified model that adds support for autonomous coding agents like Cline
555.9K Pulls 6 Tags Updated 9 months ago
Unsloth's DeepSeek-R1 , I just merged the thing and uploaded it here. This is the full 671b model. MoE Bits:1.58bit Type:UD-IQ1_S Disk Size:131GB Accuracy:Fair Details:MoE all 1.56bit. down_proj in MoE mixture of 2.06/1.56bit
170.9K Pulls 2 Tags Updated 10 months ago
DeepSeek-R1-Distill models are fine-tuned based on open-source models, using samples generated by DeepSeek-R1. We slightly change their configs and tokenizers. Please use our setting to run these models.
109.7K Pulls 2 Tags Updated 10 months ago
Unsloth's DeepSeek-R1 1.58-bit, I just merged the thing and uploaded it here. This is the full 671b model, albeit dynamically quantized to 1.58bits.
101.4K Pulls 1 Tag Updated 10 months ago
Merged GGUF Unsloth's DeepSeek-R1 671B 2.51bit dynamic quant
60.5K Pulls 1 Tag Updated 10 months ago
28.7K Pulls 4 Tags Updated 7 months ago
Merged GGUF Unsloth's DeepSeek-R1 671B 1.73bit dynamic quant
26.7K Pulls 1 Tag Updated 10 months ago
DeepSeek's first-generation of reasoning models with comparable performance to OpenAI-o1, including six dense models distilled from DeepSeek-R1 based on Llama and Qwen. With Tool Calling support.
26K Pulls 26 Tags Updated 10 months ago
This version of Deepseek R1 is optimized for tool usage with Cline and Roo Code.
17K Pulls 510 Tags Updated 9 months ago
Merged GGUF Unsloth's DeepSeek-R1 671B 2.22bit dynamic quant
5,867 Pulls 1 Tag Updated 10 months ago
Tool calling for deepseek-r1, tweaked for the goose agent
5,180 Pulls 2 Tags Updated 10 months ago
Deepseek R1 with the Claude 3.7 Sonnet system prompt. Inspired by incept5/llama3.1-claude
4,984 Pulls 1 Tag Updated 9 months ago
Many quantized GGUF versions of deepseek R1 abliterated (uncensored) with tools support
4,551 Pulls 8 Tags Updated 10 months ago
Unsloth's DeepSeek-R1 , I just merged the thing and uploaded it here. This is the full 671b model. MoE Bits:1.73bit Type:UD-IQ1_M Disk Size:158GB Accuracy:Good Details:MoE all 1.56bit. down_proj in MoE left at 2.06bit
4,296 Pulls 2 Tags Updated 10 months ago
DeepSeek-R1-Distill-Qwen-1.5B
3,463 Pulls 1 Tag Updated 9 months ago