Huzderu/deepseek-r1-671b-1.73bit

Huzderu/ deepseek-r1-671b-1.73bit:latest

26.8K Downloads Updated 1 year ago

Merged GGUF Unsloth's DeepSeek-R1 671B 1.73bit dynamic quant

ollama run Huzderu/deepseek-r1-671b-1.73bit

curl http://localhost:11434/api/chat \
  -d '{
    "model": "Huzderu/deepseek-r1-671b-1.73bit",
    "messages": [{"role": "user", "content": "Hello!"}]
  }'

from ollama import chat

response = chat(
    model='Huzderu/deepseek-r1-671b-1.73bit',
    messages=[{'role': 'user', 'content': 'Hello!'}],
)
print(response.message.content)

import ollama from 'ollama'

const response = await ollama.chat({
  model: 'Huzderu/deepseek-r1-671b-1.73bit',
  messages: [{role: 'user', content: 'Hello!'}],
})
console.log(response.message.content)

Details

Updated 1 year ago

1 year ago

08d46664e5ce · 169GB ·

model

archdeepseek2

parameters671B

quantizationIQ1_S

169GB

license

1.1kB

params

{ "stop": [ "<｜begin▁of▁sentence｜>", "<｜end▁of▁sentence｜>",

148B

template

{{- if .System }}{{ .System }}{{ end }} {{- range $i, $_ := .Messages }} {{- $last := eq (len (slice

394B

Readme

Unsloth’s DeepSeek-R1 671B 1.73-bit dynamic quantization, merged GGUF files for Ollama.

Original GGUF on HuggingFace here.

LICENSE

This code repository and the model weights are licensed under the MIT License. DeepSeek-R1 series support commercial use, allow for any modifications and derivative works, including, but not limited to, distillation for training other LLMs. Please note that:

DeepSeek-R1-Distill-Qwen-1.5B, DeepSeek-R1-Distill-Qwen-7B, DeepSeek-R1-Distill-Qwen-14B and DeepSeek-R1-Distill-Qwen-32B are derived from Qwen-2.5 series, which are originally licensed under Apache 2.0 License, and now finetuned with 800k samples curated with DeepSeek-R1.
DeepSeek-R1-Distill-Llama-8B is derived from Llama3.1-8B-Base and is originally licensed under llama3.1 license.
DeepSeek-R1-Distill-Llama-70B is derived from Llama3.3-70B-Instruct and is originally licensed under llama3.3 license.