huihui_ai/
deepseek-v3-pruned:411b-coder-0324

1,244 5 months ago

DeepSeek-V3-Pruned-Coder-411B is a pruned version of the DeepSeek-V3 reduced from 256 experts to 160 experts, The pruned model is mainly used for code generation.

411b
d318c0731575 · 160B
{
"num_gpu": 1,
"stop": [
"<|begin▁of▁sentence|>",
"<|end▁of▁sentence|>",
"<|User|>",
"<|Assistant|>"
]
}