DeepSeek's first-generation of reasoning models with comparable performance to OpenAI-o1, including six dense models distilled from DeepSeek-R1 based on Llama and Qwen.
41.1M Pulls 29 Tags Updated 2 months ago
A fully open-source family of reasoning models built using a dataset derived by distilling DeepSeek-R1.
522.8K Pulls 15 Tags Updated 3 weeks ago
A fine-tuned version of Deepseek-R1-Distilled-Qwen-1.5B that surpasses the performance of OpenAI’s o1-preview with just 1.5B parameters on popular math evaluations.
77.7K Pulls 5 Tags Updated 2 months ago
A version of the DeepSeek-R1 model that has been post trained to provide unbiased, accurate, and factual information by Perplexity.
27.4K Pulls 9 Tags Updated 2 months ago
This is a modified model that adds support for autonomous coding agents like Cline
550.4K Pulls 6 Tags Updated 2 months ago
DeepSeek's first generation reasoning models with comparable performance to OpenAI-o1.
460.8K Pulls 51 Tags Updated 2 weeks ago
Unsloth's DeepSeek-R1 , I just merged the thing and uploaded it here. This is the full 671b model. MoE Bits:1.58bit Type:UD-IQ1_S Disk Size:131GB Accuracy:Fair Details:MoE all 1.56bit. down_proj in MoE mixture of 2.06/1.56bit
170.7K Pulls 2 Tags Updated 2 months ago
Unsloth's DeepSeek-R1 1.58-bit, I just merged the thing and uploaded it here. This is the full 671b model, albeit dynamically quantized to 1.58bits.
100.3K Pulls 1 Tag Updated 3 months ago
Merged GGUF Unsloth's DeepSeek-R1 671B 2.51bit dynamic quant
60K Pulls 1 Tag Updated 3 months ago
Merged GGUF Unsloth's DeepSeek-R1 671B 1.73bit dynamic quant
26.6K Pulls 1 Tag Updated 3 months ago
19.8K Pulls 1 Tag Updated 3 months ago
DeepSeek's first-generation of reasoning models with comparable performance to OpenAI-o1, including six dense models distilled from DeepSeek-R1 based on Llama and Qwen. With Tool Calling support.
17.4K Pulls 26 Tags Updated 2 months ago
deepseek-r1:32b first generation reasoning models with comparable performance to OpenAI-o1.
15.2K Pulls 1 Tag Updated 3 months ago
This version of Deepseek R1 is optimized for tool usage with Cline and Roo Code.
11.7K Pulls 510 Tags Updated 2 months ago
Merged GGUF Unsloth's DeepSeek-R1 671B 2.22bit dynamic quant
5,814 Pulls 1 Tag Updated 3 months ago
DeepSeek-R1-Distill models are fine-tuned based on open-source models, using samples generated by DeepSeek-R1. We slightly change their configs and tokenizers. Please use our setting to run these models.
4,590 Pulls 2 Tags Updated 3 months ago
Unsloth's DeepSeek-R1 , I just merged the thing and uploaded it here. This is the full 671b model. MoE Bits:1.73bit Type:UD-IQ1_M Disk Size:158GB Accuracy:Good Details:MoE all 1.56bit. down_proj in MoE left at 2.06bit
4,267 Pulls 2 Tags Updated 2 months ago
Tool calling for deepseek-r1, tweaked for the goose agent
3,668 Pulls 2 Tags Updated 3 months ago
Many quantized GGUF versions of deepseek R1 abliterated (uncensored) with tools support
2,618 Pulls 8 Tags Updated 2 months ago