huihui_ai/deepseek-r1:671b-q3

huihui_ai/ deepseek-r1:671b-q3_K

314 Downloads Updated 1 year ago

DeepSeek's first-generation of reasoning models with comparable performance to OpenAI-o1, including six dense models distilled from DeepSeek-R1 based on Llama and Qwen.

thinking 671b

ollama run huihui_ai/deepseek-r1:671b-q3_K

curl http://localhost:11434/api/chat \
  -d '{
    "model": "huihui_ai/deepseek-r1:671b-q3_K",
    "messages": [{"role": "user", "content": "Hello!"}]
  }'

from ollama import chat

response = chat(
    model='huihui_ai/deepseek-r1:671b-q3_K',
    messages=[{'role': 'user', 'content': 'Hello!'}],
)
print(response.message.content)

import ollama from 'ollama'

const response = await ollama.chat({
  model: 'huihui_ai/deepseek-r1:671b-q3_K',
  messages: [{role: 'user', content: 'Hello!'}],
})
console.log(response.message.content)

Details

Updated 1 year ago

1 year ago

bd9309ab05f9 · 319GB ·

model

archdeepseek2

parameters671B

quantizationQ3_K_M

319GB

license

1.1kB

params

{ "stop": [ "<｜begin▁of▁sentence｜>", "<｜end▁of▁sentence｜>",

148B

template

{{- if .System }}{{ .System }}{{ end }} {{- range $i, $_ := .Messages }} {{- $last := eq (len (slice

387B

Readme

Note: this model requires Ollama 0.9 or later.

Before running the instruction, please set the num_thread to half of your current CPU thread count, otherwise it may slow down your computer. Here is an example:

/set parameter num_thread 32