957 5 months ago

deepseek-v3-0324-Quants. - Q2_K is the lowest here - quantized = round((original - zero_point) / scale)

9644c0387af6 · 19B
{
"num_ctx": 131072
}