The ollama model for the 8bit-quantized GGUF version of llama3-8b-chinese-chat (https://huggingface.co/shenzhi-wang/Llama3-8B-Chinese-Chat-GGUF-8bit).

8B

10.8K Pulls Updated 8 weeks ago

50020e23ef83 · 126B
{ "stop": [ "<|start_header_id|>", "<|end_header_id|>", "<|eot_id|>" ], "temperature": 0.6, "top_p": 0.9 }