cwchang/llama-3-taiwan-8b-instruct:q5

cwchang/

llama-3-taiwan-8b-instruct:q5_0

2,091 Downloads Updated 1 year ago

The model used is a quantized version of `Llama-3-Taiwan-8B-Instruct`. More details can be found on the https://huggingface.co/yentinglin/Llama-3-Taiwan-8B-Instruct

Updated 1 year ago

1 year ago

bda3a66d7ea4 · 5.6GB ·

model

archllama

parameters8.03B

quantizationQ5_0

5.6GB

params

{ "num_ctx": 8192, "stop": [ "<|start_header_id|>", "<|end_header_id|>",

143B

template

"{{ if .System }}<|start_header_id|>system<|end_header_id|> {{ .System }}<|eot_id|>{{ end }}{{ if .P

257B

Readme

The model used is a quantized version of Llama-3 Taiwan 8B Instruct, a specialized model designed for traditional Chinese conversation with 8 billion parameters. Quantization reduces the model’s size and computational requirements while maintaining performance, making it suitable for deployment in resource-constrained environments. More details can be found on the Hugging Face page.