An upgraded version of DeekSeek-V2 that integrates the general and coding abilities of both DeepSeek-V2-Chat and DeepSeek-Coder-V2-Instruct.
236b
8,832 Pulls Updated 8 weeks ago
Updated 8 weeks ago
8 weeks ago
409b2dd8a3c4 · 133GB
model
archdeepseek2
·
parameters236B
·
quantizationQ4_0
133GB
params
{"stop":["\u003c|begin▁of▁sentence|\u003e","\u003c|end▁of▁sentence|\u003e","\u003c
241B
template
{{- if .Suffix }}<|fim▁begin|>{{ .Prompt }}<|fim▁hole|>{{ .Suffix }}<|fim▁end|>
{{
493B
license
DEEPSEEK LICENSE AGREEMENT
Version 1.0, 23 October 2023
Copyright (c) 2023 DeepSeek
Section I: PR
14kB
Readme
DeepSeek-V2.5 is an upgraded version that combines DeepSeek-V2-Chat and DeepSeek-Coder-V2-Instruct. The new model integrates the general and coding abilities of the two previous versions.
DeepSeek-V2.5 better aligns with human preferences and has been optimized in various aspects, including writing and instruction following:
Metric | DeepSeek-V2-0628 | DeepSeek-Coder-V2-0724 | DeepSeek-V2.5 |
---|---|---|---|
AlpacaEval 2.0 | 46.6 | 44.5 | 50.5 |
ArenaHard | 68.3 | 66.3 | 76.2 |
AlignBench | 7.88 | 7.91 | 8.04 |
MT-Bench | 8.85 | 8.91 | 9.02 |
HumanEval python | 84.5 | 87.2 | 89 |
HumanEval Multi | 73.8 | 74.8 | 73.8 |
LiveCodeBench(01-09) | 36.6 | 39.7 | 41.8 |
Aider | 69.9 | 72.9 | 72.2 |
SWE-verified | N/A | 19 | 16.8 |
DS-FIM-Eval | N/A | 73.2 | 78.3 |
DS-Arena-Code | N/A | 49.5 | 63.1 |