DeepSeek's first-generation of reasoning models with comparable performance to OpenAI-o1, including six dense models distilled from DeepSeek-R1 based on Llama and Qwen.
671b
44 Pulls Updated 3 weeks ago
Updated 3 weeks ago
3 weeks ago
bd9309ab05f9 · 319GB
model
archdeepseek2
·
parameters671B
·
quantizationQ3_K_M
319GB
params
{
"stop": [
"<|begin▁of▁sentence|>",
"<|end▁of▁sentence|>",
148B
template
{{- if .System }}{{ .System }}{{ end }}
{{- range $i, $_ := .Messages }}
{{- $last := eq (len (slice
387B
license
MIT License
Copyright (c) 2023 DeepSeek
Permission is hereby granted, free of charge, to any perso
1.1kB
Readme
Note: this model requires Ollama 0.5.5 or later.