DeepSeek-R1 is a family of open reasoning models with performance approaching that of leading models, such as O3 and Gemini 2.5 Pro.
74.5M Pulls 35 Tags Updated 5 months ago
DeepSeek-V3.1-Terminus is a hybrid model that supports both thinking mode and non-thinking mode.
207.9K Pulls 8 Tags Updated 2 months ago
This is a modified model that adds support for autonomous coding agents like Cline
556.1K Pulls 6 Tags Updated 9 months ago
DeepSeek's first-generation of reasoning models with comparable performance to OpenAI-o1, including six dense models distilled from DeepSeek-R1 based on Llama and Qwen. With Tool Calling support.
26.4K Pulls 26 Tags Updated 10 months ago
This version of Deepseek R1 is optimized for tool usage with Cline and Roo Code.
17.1K Pulls 510 Tags Updated 10 months ago
Tool calling for deepseek-r1, tweaked for the goose agent
5,227 Pulls 2 Tags Updated 10 months ago
Many quantized GGUF versions of deepseek R1 abliterated (uncensored) with tools support
4,614 Pulls 8 Tags Updated 10 months ago
Adapted for Cline tool / Roo Code use in VS Code fused model , hybrid of DeepSeekR1 and Qwen2.5 coder, from FuseAI/FuseO1-DeepSeekR1-Qwen2.5-Coder-32B-Preview.
4,302 Pulls 2 Tags Updated 10 months ago
A strong, economical, and efficient Mixture-of-Experts language model with Tool Calling support.
3,422 Pulls 3 Tags Updated 11 months ago
Adapted to work with Cline (Claude Dev) Visual Studio Code
3,012 Pulls 1 Tag Updated 10 months ago
2,012 Pulls 6 Tags Updated 9 months ago
DeepSeek R1 0528 Qwen3 8B with tool calling/MCP support
1,908 Pulls 1 Tag Updated 5 months ago
Deepseek R1 optimized for tool usage with Cline.
1,661 Pulls 3 Tags Updated 9 months ago
Quantized version of DeepSeek-R1-32B optimized for tool usage with Cline / Roo Code and complex problem solving.
1,490 Pulls 1 Tag Updated 7 months ago
Tiny-R1-32B-Preview, which outperforms the 70B model Deepseek-R1-Distill-Llama-70B and nearly matches the full R1 model in math.
1,321 Pulls 6 Tags Updated 9 months ago
DeepSeek R1 0528 Qwen3 8B Q4 with tool calling
1,128 Pulls 1 Tag Updated 6 months ago
902 Pulls 2 Tags Updated 10 months ago
This model is a distilled version of Qwen/Qwen3-30B-A3B-Instruct designed to inherit the reasoning and behavioral characteristics of its much larger teacher model, deepseek-ai/DeepSeek-V3.1.
859 Pulls 2 Tags Updated 3 months ago
This model has been developed based on DistilQwen2.5-DS3-0324-Series.
821 Pulls 7 Tags Updated 7 months ago
ollama run deepseek-v3
805 Pulls 1 Tag Updated 10 months ago