deepseek-v2.5

An upgraded version of DeekSeek-V2 that integrates the general and coding abilities of both DeepSeek-V2-Chat and DeepSeek-Coder-V2-Instruct.

236b

277.5K Pulls 7 Tags Updated 1 year ago

milkey/deepseek-v2.5-1210

DeepSeek-V2.5-1210 is an upgraded version of DeepSeek-V2.5, offering enhanced mathematical, coding, writing, and reasoning capabilities.

388 Pulls 3 Tags Updated 1 year ago

milkey/deepseek-v2.5-1210-UD

(Unsloth Dynamic Quants) DeepSeek-V2.5-1210 is an upgraded version of DeepSeek-V2.5, offering enhanced mathematical, coding, writing, and reasoning capabilities.

192 Pulls 3 Tags Updated 1 year ago

yoshi_likes_e4/deepseek-v2.5

4 Pulls 1 Tag Updated 1 year ago

huihui_ai/deepseek-v3-pruned

DeepSeek-V3-Pruned-Coder-411B is a pruned version of the DeepSeek-V3 reduced from 256 experts to 160 experts, The pruned model is mainly used for code generation.

411b

1,390 Pulls 5 Tags Updated 1 year ago

huihui_ai/deepseek-r1-pruned

DeepSeek-R1-Pruned-Coder-411B is a pruned version of the DeepSeek-R1 reduced from 256 experts to 160 experts, The pruned model is mainly used for code generation.

411b

92 Pulls 3 Tags Updated 1 year ago

xiaowangge/deepseek-v3-qwen2.5

This model has been developed based on DistilQwen2.5-DS3-0324-Series.

tools 32b

1,223 Pulls 7 Tags Updated 1 year ago

lordoliver/DeepSeek-V3-0324

DeepSeep V3 from March 2025 Merged from Unsloth's HF - 671B params - Q8_0/713 GB & Q4_K_M/404 GB

671b

959 Pulls 4 Tags Updated 1 year ago

pdurugyan/qwen3.5-9b-deepseek-v4-flash-Q4_K_M-v_2

Jackrong/Qwen3.5-9B-DeepSeek-V4-Flash - is an efficient reasoning model distilled using high-quality data from DeepSeek-V4. + vision. ollama v.0.30.0-rc20 +

vision tools thinking

3,303 Pulls 1 Tag Updated 4 weeks ago