DeepSeek-V3-Pruned-Coder-411B is a pruned version of the DeepSeek-V3 reduced from 256 experts to 160 experts, The pruned model is mainly used for code generation.
411b
74 Pulls Updated 2 weeks ago
3 Tags
dc92a7c65133 • 257GB •
2 weeks ago
dc92a7c65133 • 257GB •
2 weeks ago
dc92a7c65133 • 257GB •
2 weeks ago