The ollama model for the 4bit-quantized GGUF version of llama3-8b-chinese-chat (https://huggingface.co/shenzhi-wang/Llama3-8B-Chinese-Chat-GGUF-4bit).
6,057 Pulls Updated 7 months ago
50020e23ef83 · 126B
{
"stop": [
"<|start_header_id|>",
"<|end_header_id|>",
"<|eot_id|>"
],
"temperature": 0.6,
"top_p": 0.9
}