2,029 7 months ago

Unsloth's DeepSeek-R1 , I just merged the thing and uploaded it here. This is the full 671b model. MoE Bits:2.51bit Type:UD-Q2_K_XL Disk Size:212GB Accuracy:Best Details:MoE all 2.5bit. down_proj in MoE mixture of 3.5/2.5bit

7 months ago

9dee85e0fe5f · 227GB

deepseek2
·
671B
·
Q2_K
{{- if .System }}{{ .System }}{{ end }} {{- range $i, $_ := .Messages }} {{- $last := eq (len (slice
{ "num_gpu": 11, "stop": [ "<|begin▁of▁sentence|>", "<|end▁of▁

Readme

tag latest :The graphics card is not supported for both ollama and screen display at the same time

tag 24g for 24g vRAM: 21g model + 1.8g fp16 kv cache + windows system reserve