The model used is a quantized version of `Llama-3-Taiwan-8B-Instruct-DPO`. More details can be found on the website (https://huggingface.co/yentinglin/Llama-3-Taiwan-8B-Instruct-DPO)
50 Pulls Updated 4 months ago
Updated 4 months ago
4 months ago
e495357aca5c · 5.6GB
model
archllama
·
parameters8.03B
·
quantizationQ5_0
5.6GB
params
{"num_ctx":8192,"stop":["\u003c|start_header_id|\u003e","\u003c|end_header_id|\u003e","\u003c|end_of
171B
template
"{{ if .System }}<|start_header_id|>system<|end_header_id|>
{{ .System }}<|eot_id|>{{ end }}{{ if .
257B
Readme
No readme