An upgraded version of DeekSeek-V2 that integrates the general and coding abilities of both DeepSeek-V2-Chat and DeepSeek-Coder-V2-Instruct.
277.5K Pulls 7 Tags Updated 1 year ago
DeepSeek-V2.5-1210 is an upgraded version of DeepSeek-V2.5, offering enhanced mathematical, coding, writing, and reasoning capabilities.
388 Pulls 3 Tags Updated 1 year ago
(Unsloth Dynamic Quants) DeepSeek-V2.5-1210 is an upgraded version of DeepSeek-V2.5, offering enhanced mathematical, coding, writing, and reasoning capabilities.
192 Pulls 3 Tags Updated 1 year ago
4 Pulls 1 Tag Updated 1 year ago
DeepSeek-V3-Pruned-Coder-411B is a pruned version of the DeepSeek-V3 reduced from 256 experts to 160 experts, The pruned model is mainly used for code generation.
1,390 Pulls 5 Tags Updated 1 year ago
DeepSeek-R1-Pruned-Coder-411B is a pruned version of the DeepSeek-R1 reduced from 256 experts to 160 experts, The pruned model is mainly used for code generation.
92 Pulls 3 Tags Updated 1 year ago
This model has been developed based on DistilQwen2.5-DS3-0324-Series.
1,223 Pulls 7 Tags Updated 1 year ago
DeepSeep V3 from March 2025 Merged from Unsloth's HF - 671B params - Q8_0/713 GB & Q4_K_M/404 GB
959 Pulls 4 Tags Updated 1 year ago
Jackrong/Qwen3.5-9B-DeepSeek-V4-Flash - is an efficient reasoning model distilled using high-quality data from DeepSeek-V4. + vision. ollama v.0.30.0-rc20 +
3,303 Pulls 1 Tag Updated 4 weeks ago