DeepSeek-V3.2, a model that harmonizes high computational efficiency with superior reasoning and agent performance.
2.2M Pulls 1 Tag Updated 5 months ago
DeepSeek-V4-Flash is a preview of the DeepSeek-V4 series, a Mixture-of-Experts model with 284B total parameters and 13B activated, built for efficient reasoning across a 1M-token context window.
94.9K Pulls 1 Tag Updated 1 month ago
DeepSeek-V4-Pro is a frontier Mixture-of-Experts model with a 1M-token context window and three reasoning modes.
87.1K Pulls 1 Tag Updated 1 month ago
DeepSeek-V3.1-Terminus is a hybrid model that supports both thinking mode and non-thinking mode.
689.8K Pulls 8 Tags Updated 8 months ago
DeepSeek-R1 is a family of open reasoning models with performance approaching that of leading models, such as O3 and Gemini 2.5 Pro.
86.5M Pulls 35 Tags Updated 11 months ago
pull from hf.co/Jackrong/Qwen3.5-9B-DeepSeek-V4-Flash-GGUF
2,119 Pulls 1 Tag Updated 1 week ago
Jackrong/Qwen3.5-9B-DeepSeek-V4-Flash - is an efficient reasoning model distilled using high-quality data from DeepSeek-V4. + vision. ollama v.0.30.0-rc20 +
1,342 Pulls 1 Tag Updated 1 week ago
Jackrong/Qwen3.5-9B-DeepSeek-V4-Flash - is an efficient reasoning model distilled using high-quality data from DeepSeek-V4.
1,197 Pulls 1 Tag Updated 2 weeks ago
DeepSeek-R1-0528-Qwen3-8B
237 Pulls 1 Tag Updated 3 weeks ago
90 Pulls 1 Tag Updated 1 week ago
I have just enabled both calling and thinking to existing deepseek-r1 models.
1,358 Pulls 6 Tags Updated 1 month ago
3,261 Pulls 1 Tag Updated 5 months ago
ollama run f0rc3ps/deepseek-r1-32b-uncensored:nu11secur1ty
990 Pulls 1 Tag Updated 2 months ago
1,099 Pulls 1 Tag Updated 4 months ago
144 Pulls 1 Tag Updated 1 month ago
145 Pulls 1 Tag Updated 1 month ago
Based on DeepSeek R1 because OpenCode tries to verify on the registry for tool compatibility
201 Pulls 1 Tag Updated 2 months ago
This is a brand new Mixture of Export (MoE) model from DeepSeek, specializing in coding instructions. (quantized IQ4_XS)
14.5K Pulls 3 Tags Updated 4 months ago
42 Pulls 1 Tag Updated 1 month ago
This model is a distilled version of Qwen/Qwen3-30B-A3B-Instruct designed to inherit the reasoning and behavioral characteristics of its much larger teacher model, deepseek-ai/DeepSeek-V3.1.
1,961 Pulls 2 Tags Updated 8 months ago