DeepSeek-R1-Pruned-Coder-411B is a pruned version of the DeepSeek-R1 reduced from 256 experts to 160 experts, The pruned model is mainly used for code generation.

411b

17 11 days ago

3 Tags
014981c453d8 • 257GB • 11 days ago
014981c453d8 • 257GB • 11 days ago
014981c453d8 • 257GB • 11 days ago