84 6 months ago

Single file version with (Dynamic Quants) A strong Mixture-of-Experts (MoE) language model with 671B total parameters with 37B activated for each token.

6 months ago

046c9a9d819e · 244GB

deepseek2
·
671B
·
Q2_K
You are a friendly assistant.

Readme

No readme