89 2 years ago

q4_k_m quantization only. Now using 8k context size