deepseek-v2.5:236b-q4

deepseek-v2.5:236b-q4_1

276.7K Downloads Updated 1 year ago

An upgraded version of DeekSeek-V2 that integrates the general and coding abilities of both DeepSeek-V2-Chat and DeepSeek-Coder-V2-Instruct.

236b

ollama run deepseek-v2.5:236b-q4_1

curl http://localhost:11434/api/chat \
  -d '{
    "model": "deepseek-v2.5:236b-q4_1",
    "messages": [{"role": "user", "content": "Hello!"}]
  }'

from ollama import chat

response = chat(
    model='deepseek-v2.5:236b-q4_1',
    messages=[{'role': 'user', 'content': 'Hello!'}],
)
print(response.message.content)

import ollama from 'ollama'

const response = await ollama.chat({
  model: 'deepseek-v2.5:236b-q4_1',
  messages: [{role: 'user', content: 'Hello!'}],
})
console.log(response.message.content)

Details

Updated 1 year ago

1 year ago

b1618a20b8b5 · 148GB ·

model

archdeepseek2

parameters236B

quantizationQ4_1

148GB

template

{{- if .Suffix }}<｜fim▁begin｜>{{ .Prompt }}<｜fim▁hole｜>{{ .Suffix }}<｜fim▁end｜> {{

493B

license

14kB

params

{ "stop": [ "<｜begin▁of▁sentence｜>", "<｜end▁of▁sentence｜>",

241B

Readme

DeepSeek-V2.5 is an upgraded version that combines DeepSeek-V2-Chat and DeepSeek-Coder-V2-Instruct. The new model integrates the general and coding abilities of the two previous versions.

DeepSeek-V2.5 better aligns with human preferences and has been optimized in various aspects, including writing and instruction following:

Metric	DeepSeek-V2-0628	DeepSeek-Coder-V2-0724	DeepSeek-V2.5
AlpacaEval 2.0	46.6	44.5	50.5
ArenaHard	68.3	66.3	76.2
AlignBench	7.88	7.91	8.04
MT-Bench	8.85	8.91	9.02
HumanEval python	84.5	87.2	89
HumanEval Multi	73.8	74.8	73.8
LiveCodeBench(01-09)	36.6	39.7	41.8
Aider	69.9	72.9	72.2
SWE-verified	N/A	19	16.8
DS-FIM-Eval	N/A	73.2	78.3
DS-Arena-Code	N/A	49.5	63.1

Reference

Hugging Face

DeepSeek-V2.5 is an upgraded version that combines DeepSeek-V2-Chat and DeepSeek-Coder-V2-Instruct. The new model integrates the general and coding abilities of the two previous versions.

DeepSeek-V2.5 better aligns with human preferences and has been optimized in various aspects, including writing and instruction following:

| Metric                 | DeepSeek-V2-0628 | DeepSeek-Coder-V2-0724 | DeepSeek-V2.5 |
|:-----------------------|:-----------------|:-----------------------|:--------------|
| AlpacaEval 2.0          | 46.6             | 44.5                   | 50.5          |
| ArenaHard              | 68.3             | 66.3                   | 76.2          |
| AlignBench             | 7.88             | 7.91                   | 8.04          |
| MT-Bench               | 8.85             | 8.91                   | 9.02          |
| HumanEval python       | 84.5             | 87.2                   | 89            |
| HumanEval Multi        | 73.8             | 74.8                   | 73.8          |
| LiveCodeBench(01-09)   | 36.6             | 39.7                   | 41.8          |
| Aider                  | 69.9             | 72.9                   | 72.2          |
| SWE-verified           | N/A              | 19                     | 16.8          |
| DS-FIM-Eval            | N/A              | 73.2                   | 78.3          |
| DS-Arena-Code          | N/A              | 49.5                   | 63.1          |

## Reference

[Hugging Face](https://huggingface.co/deepseek-ai/DeepSeek-V2.5)

Paste, drop or click to upload images (.png, .jpeg, .jpg, .svg, .gif)