DeepSeek-R1 is a family of open reasoning models with performance approaching that of leading models, such as O3 and Gemini 2.5 Pro.
81.8M Pulls 35 Tags Updated 9 months ago
DeepSeek-V3.2, a model that harmonizes high computational efficiency with superior reasoning and agent performance.
68.3K Pulls 1 Tag Updated 3 months ago
DeepSeek-OCR is a vision-language model that can perform token-efficient OCR.
379.8K Pulls 3 Tags Updated 4 months ago
A fine-tuned version of Deepseek-R1-Distilled-Qwen-1.5B that surpasses the performance of OpenAI’s o1-preview with just 1.5B parameters on popular math evaluations.
1.2M Pulls 5 Tags Updated 1 year ago
A fully open-source family of reasoning models built using a dataset derived by distilling DeepSeek-R1.
1M Pulls 15 Tags Updated 12 months ago
A version of the DeepSeek-R1 model that has been post trained to provide unbiased, accurate, and factual information by Perplexity.
360K Pulls 9 Tags Updated 1 year ago
1 Pull 1 Tag Updated 4 months ago
Huggingface link - https://huggingface.co/iradukunda-dev/law-finetuned-DeepSeek-R1-Distill-Qwen-7B
273 Pulls 1 Tag Updated 2 months ago
DeepSeek R1 0528 Qwen3 8B with tool calling/MCP support
2,670 Pulls 1 Tag Updated 8 months ago
DeepSeek R1 0528 Qwen3 8B Q4 with tool calling
1,757 Pulls 1 Tag Updated 10 months ago
Quantized version of DeepSeek-R1-32B optimized for tool usage with Cline / Roo Code and complex problem solving.
1,675 Pulls 1 Tag Updated 11 months ago
DeepSeek-R1-0528 仍然使用 2024 年 12 月所发布的 DeepSeek V3 Base 模型作为基座,但在后训练过程中投入了更多算力,显著提升了模型的思维深度与推理能力。这个8B精馏版本编程能力都爆表!
982 Pulls 1 Tag Updated 10 months ago
16k Context Window meaning you need less RAM to run this. It's full context windows is loaded in the deepseekq3_coder. It allocates the RAM needed for the context when loading the model.
483 Pulls 1 Tag Updated 8 months ago
384 Pulls 1 Tag Updated 8 months ago
103 Pulls 1 Tag Updated 9 months ago
Unsloth's DeepSeek-R1 1.58-bit, I just merged the thing and uploaded it here. This is the full 671b model, albeit dynamically quantized to 1.58bits.
101.5K Pulls 1 Tag Updated 1 year ago
Merged GGUF Unsloth's DeepSeek-R1 671B 2.51bit dynamic quant
60.5K Pulls 1 Tag Updated 1 year ago
Merged GGUF Unsloth's DeepSeek-R1 671B 1.73bit dynamic quant
26.8K Pulls 1 Tag Updated 1 year ago
This version of Deepseek R1 is optimized for tool usage with Cline and Roo Code.
18.6K Pulls 510 Tags Updated 1 year ago
Merged GGUF Unsloth's DeepSeek-R1 671B 2.22bit dynamic quant
5,876 Pulls 1 Tag Updated 1 year ago