DeepSeek-V3.2, a model that harmonizes high computational efficiency with superior reasoning and agent performance.
68.4K Pulls 1 Tag Updated 3 months ago
DeepSeek-V3.1-Terminus is a hybrid model that supports both thinking mode and non-thinking mode.
543.2K Pulls 8 Tags Updated 6 months ago
DeepSeek-R1 is a family of open reasoning models with performance approaching that of leading models, such as O3 and Gemini 2.5 Pro.
81.8M Pulls 35 Tags Updated 9 months ago
ollama run f0rc3ps/deepseek-r1-32b-uncensored:nu11secur1ty
405 Pulls 1 Tag Updated 1 week ago
Based on DeepSeek R1 because OpenCode tries to verify on the registry for tool compatibility
130 Pulls 1 Tag Updated 3 weeks ago
3,049 Pulls 1 Tag Updated 3 months ago
783 Pulls 1 Tag Updated 2 months ago
This is a brand new Mixture of Export (MoE) model from DeepSeek, specializing in coding instructions. (quantized IQ4_XS)
6,729 Pulls 3 Tags Updated 2 months ago
DeepSeek R1 0528 Qwen3 8B with tool calling/MCP support
2,671 Pulls 1 Tag Updated 8 months ago
This model is a distilled version of Qwen/Qwen3-30B-A3B-Instruct designed to inherit the reasoning and behavioral characteristics of its much larger teacher model, deepseek-ai/DeepSeek-V3.1.
1,801 Pulls 2 Tags Updated 6 months ago
DeepSeek R1 0528 Qwen3 8B Q4 with tool calling
1,758 Pulls 1 Tag Updated 10 months ago
Quantized version of DeepSeek-R1-32B optimized for tool usage with Cline / Roo Code and complex problem solving.
1,675 Pulls 1 Tag Updated 11 months ago
deepseek
28 Pulls 1 Tag Updated 5 months ago
DeepSeek-Coder-V2-Lite-Instruct.Q6_K
869 Pulls 1 Tag Updated 1 month ago
This model has been developed based on DistilQwen2.5-DS3-0324-Series.
1,029 Pulls 7 Tags Updated 11 months ago
16k Context Window meaning you need less RAM to run this. It's full context windows is loaded in the deepseekq3_coder. It allocates the RAM needed for the context when loading the model.
483 Pulls 1 Tag Updated 8 months ago
Added system prompt to deepseek's new 8B model with Qwen 3, potentially could help, also kept context large as well as temp strict.
448 Pulls 1 Tag Updated 8 months ago
384 Pulls 1 Tag Updated 8 months ago
187 Pulls 2 Tags Updated 8 months ago
This is not the ablation version. DeepSeek-V3.1 is a hybrid model that supports both thinking mode and non-thinking mode.
175 Pulls 3 Tags Updated 7 months ago