second_constantine/deepseek-coder-v2

second_constantine/

deepseek-coder-v2

368 Downloads Updated 2 months ago

This is a brand new Mixture of Export (MoE) model from DeepSeek, specializing in coding instructions. (quantized IQ4_XS)

tools 16b

Name

1 model

Size

Context

Input

deepseek-coder-v2:16b

8.6GB · 160K context window · Text · 2 months ago

deepseek-coder-v2:16b

8.6GB

160K

Text

Device	Speed, token/s	Context	VRAM, gb	Versions
RTX 3090 24gb	~139	4096	10	IQ4_XS,0.12.2
RTX 3090 24gb	~139	15360	13	IQ4_XS,0.12.2
RTX 2080ti 11gb	~114	4096	10	IQ4_XS,0.12.2
RTX 2080ti 11gb	~37	15360	14 (23%/77% CPU/GPU)	IQ4_XS, 0.12.2
M1 Max 32gb	~80	4096	10	IQ4_XS,0.12.2
M1 Max 32gb	~81	15360	14	IQ4_XS,0.12.2
RTX 3070ti Mobile 8gb	~31	4096	10 (26%/74% CPU/GPU)	IQ4_XS, 0.12.3
RTX 3070ti Mobile 8gb	~30	15360	14 (46%/54% CPU/GPU)	IQ4_XS, 0.12.3