DeepSeek-R1 is a family of open reasoning models with performance approaching that of leading models, such as O3 and Gemini 2.5 Pro.
67M Pulls 35 Tags Updated 3 months ago
A fine-tuned version of Deepseek-R1-Distilled-Qwen-1.5B that surpasses the performance of OpenAI’s o1-preview with just 1.5B parameters on popular math evaluations.
531.1K Pulls 5 Tags Updated 8 months ago
A fully open-source family of reasoning models built using a dataset derived by distilling DeepSeek-R1.
603.4K Pulls 15 Tags Updated 6 months ago
A version of the DeepSeek-R1 model that has been post trained to provide unbiased, accurate, and factual information by Perplexity.
119.5K Pulls 9 Tags Updated 8 months ago
DeepSeek's first generation reasoning models with comparable performance to OpenAI-o1.
580.6K Pulls 55 Tags Updated 4 months ago
This is a modified model that adds support for autonomous coding agents like Cline
555.1K Pulls 6 Tags Updated 7 months ago
Unsloth's DeepSeek-R1 1.58-bit, I just merged the thing and uploaded it here. This is the full 671b model, albeit dynamically quantized to 1.58bits.
101.2K Pulls 1 Tag Updated 8 months ago
Merged GGUF Unsloth's DeepSeek-R1 671B 2.51bit dynamic quant
60.4K Pulls 1 Tag Updated 8 months ago
Merged GGUF Unsloth's DeepSeek-R1 671B 1.73bit dynamic quant
26.7K Pulls 1 Tag Updated 8 months ago
DeepSeek's first-generation of reasoning models with comparable performance to OpenAI-o1, including six dense models distilled from DeepSeek-R1 based on Llama and Qwen. With Tool Calling support.
24.8K Pulls 26 Tags Updated 8 months ago
This version of Deepseek R1 is optimized for tool usage with Cline and Roo Code.
16.3K Pulls 510 Tags Updated 8 months ago
Merged GGUF Unsloth's DeepSeek-R1 671B 2.22bit dynamic quant
5,864 Pulls 1 Tag Updated 8 months ago
Tool calling for deepseek-r1, tweaked for the goose agent
5,040 Pulls 2 Tags Updated 8 months ago
Deepseek R1 with the Claude 3.7 Sonnet system prompt. Inspired by incept5/llama3.1-claude
4,768 Pulls 1 Tag Updated 7 months ago
Additional training on Japanese data by CyberAgent for deepseek-r1.
2,918 Pulls 2 Tags Updated 8 months ago
Fixed uncensored deepseek-r1:8b and 14b
2,465 Pulls 2 Tags Updated 8 months ago
The mradermacher model of DeepSeek-r1 llama distilled with abliteration applied
2,382 Pulls 1 Tag Updated 9 months ago
DeepSeek R1 0528 Qwen3 8B with tool calling/MCP support
1,713 Pulls 1 Tag Updated 3 months ago
1,705 Pulls 6 Tags Updated 8 months ago
Quantized version of DeepSeek-R1-32B optimized for tool usage with Cline / Roo Code and complex problem solving.
1,376 Pulls 1 Tag Updated 5 months ago