The ollama model for the 8bit-quantized GGUF version of llama3-8b-chinese-chat (https://huggingface.co/shenzhi-wang/Llama3-8B-Chinese-Chat-GGUF-8bit).

13.6K 7 months ago

4fc7dad80c9b · 30B
You are a helpful assistant.