100 1 month ago

Yi-Coder 9B Chat quantized to Q4_K_M using llama.cpp, reducing size from ~18 GB to ~5.3 GB. Runs on 8 GB RAM. Optimized for Python test generation with pytest. Produces clean, ready-to-run code. Apache 2.0 licensed.

e3b2721cec51 · 102B
{{- range .Messages }}<|im_start|>{{ .Role }}
{{ .Content }}<|im_end|>
{{ end }}<|im_start|>assistant