170.9K 8 months ago

Unsloth's DeepSeek-R1 , I just merged the thing and uploaded it here. This is the full 671b model. MoE Bits:1.58bit Type:UD-IQ1_S Disk Size:131GB Accuracy:Fair Details:MoE all 1.56bit. down_proj in MoE mixture of 2.06/1.56bit

8 months ago

bd8b066a31fb · 140GB ·

deepseek2
·
671B
·
IQ1_S
{ "stop": [ "<|begin▁of▁sentence|>", "<|end▁of▁sentence|>",
{{- if .System }}{{ .System }}{{ end }} {{- range $i, $_ := .Messages }} {{- $last := eq (len (slice

Readme

tag latest :The graphics card is not supported for both ollama and screen display at the same time

tag 24g for 24g vRAM: 16g model + 1.8g fp16 kv cache + windows system reserve