DeepSeek-R1 is a family of open reasoning models with performance approaching that of leading models, such as O3 and Gemini 2.5 Pro.
74.2M Pulls 35 Tags Updated 5 months ago
A fine-tuned version of Deepseek-R1-Distilled-Qwen-1.5B that surpasses the performance of OpenAI’s o1-preview with just 1.5B parameters on popular math evaluations.
840.2K Pulls 5 Tags Updated 10 months ago
A fully open-source family of reasoning models built using a dataset derived by distilling DeepSeek-R1.
627.3K Pulls 15 Tags Updated 8 months ago
A version of the DeepSeek-R1 model that has been post trained to provide unbiased, accurate, and factual information by Perplexity.
152.4K Pulls 9 Tags Updated 9 months ago
DeepSeek's first generation reasoning models with comparable performance to OpenAI-o1.
606.8K Pulls 55 Tags Updated 6 months ago
This is a modified model that adds support for autonomous coding agents like Cline
556K Pulls 6 Tags Updated 9 months ago
Unsloth's DeepSeek-R1 , I just merged the thing and uploaded it here. This is the full 671b model. MoE Bits:1.58bit Type:UD-IQ1_S Disk Size:131GB Accuracy:Fair Details:MoE all 1.56bit. down_proj in MoE mixture of 2.06/1.56bit
170.9K Pulls 2 Tags Updated 10 months ago
DeepSeek-R1-Distill models are fine-tuned based on open-source models, using samples generated by DeepSeek-R1. We slightly change their configs and tokenizers. Please use our setting to run these models.
119K Pulls 2 Tags Updated 10 months ago
Unsloth's DeepSeek-R1 1.58-bit, I just merged the thing and uploaded it here. This is the full 671b model, albeit dynamically quantized to 1.58bits.
101.4K Pulls 1 Tag Updated 10 months ago
Merged GGUF Unsloth's DeepSeek-R1 671B 2.51bit dynamic quant
60.5K Pulls 1 Tag Updated 10 months ago
29K Pulls 4 Tags Updated 7 months ago
Merged GGUF Unsloth's DeepSeek-R1 671B 1.73bit dynamic quant
26.7K Pulls 1 Tag Updated 10 months ago
DeepSeek's first-generation of reasoning models with comparable performance to OpenAI-o1, including six dense models distilled from DeepSeek-R1 based on Llama and Qwen. With Tool Calling support.
26.2K Pulls 26 Tags Updated 10 months ago
This version of Deepseek R1 is optimized for tool usage with Cline and Roo Code.
17.1K Pulls 510 Tags Updated 10 months ago
Merged GGUF Unsloth's DeepSeek-R1 671B 2.22bit dynamic quant
5,867 Pulls 1 Tag Updated 10 months ago
Tool calling for deepseek-r1, tweaked for the goose agent
5,202 Pulls 2 Tags Updated 10 months ago
Deepseek R1 with the Claude 3.7 Sonnet system prompt. Inspired by incept5/llama3.1-claude
5,002 Pulls 1 Tag Updated 9 months ago
Many quantized GGUF versions of deepseek R1 abliterated (uncensored) with tools support
4,583 Pulls 8 Tags Updated 10 months ago
Unsloth's DeepSeek-R1 , I just merged the thing and uploaded it here. This is the full 671b model. MoE Bits:1.73bit Type:UD-IQ1_M Disk Size:158GB Accuracy:Good Details:MoE all 1.56bit. down_proj in MoE left at 2.06bit
4,296 Pulls 2 Tags Updated 10 months ago
DeepSeek-R1-Distill-Qwen-1.5B
3,509 Pulls 1 Tag Updated 10 months ago