DeepSeek-V3.2, a model that harmonizes high computational efficiency with superior reasoning and agent performance.
2.2M Pulls 1 Tag Updated 6 months ago
48 Pulls 1 Tag Updated 1 month ago
3,291 Pulls 1 Tag Updated 6 months ago
DeepSeek-V3-Pruned-Coder-411B is a pruned version of the DeepSeek-V3 reduced from 256 experts to 160 experts, The pruned model is mainly used for code generation.
1,390 Pulls 5 Tags Updated 1 year ago
DeepSeep V3 from March 2025 Merged from Unsloth's HF - 671B params - Q8_0/713 GB & Q4_K_M/404 GB
959 Pulls 4 Tags Updated 1 year ago
Quantized version of DeepSeek-R1-32B optimized for tool usage with Cline / Roo Code and complex problem solving.
1,759 Pulls 1 Tag Updated 1 year ago
Merged Unsloth's Dynamic Quantization
1,382 Pulls 1 Tag Updated 1 year ago
This model has been developed based on DistilQwen2.5-DS3-0324-Series.
1,223 Pulls 7 Tags Updated 1 year ago
deepseek-v3-0324-Quants. - Q2_K is the lowest here - quantized = round((original - zero_point) / scale)
1,131 Pulls 1 Tag Updated 1 year ago
dynamic quants from unsloth, merged
294 Pulls 1 Tag Updated 1 year ago
Latest DeepSeek_V3 model Q4
249 Pulls 1 Tag Updated 1 year ago