The ollama model for the 8bit-quantized GGUF version of llama3-8b-chinese-chat (https://huggingface.co/shenzhi-wang/Llama3-8B-Chinese-Chat-GGUF-8bit).

8B

10.8K Pulls Updated 8 weeks ago

fbbe9f68bba1 · 256B
{{ if .System }}<|start_header_id|>system<|end_header_id|> {{ .System }}<|eot_id|>{{ end }}{{ if .Prompt }}<|start_header_id|>user<|end_header_id|> {{ .Prompt }}<|eot_id|>{{ end }}<|start_header_id|>assistant<|end_header_id|> {{ .Response }}<|eot_id|>