mannix/llama3-sppo-iter3:q6

mannix/ llama3-sppo-iter3:q6_k

265 Downloads Updated 2 years ago

Meta Llama-3-8b with Self-Play Preference Optimization for Language Model Alignment at iteration 3

template

8ab4849b038c · 254B

{{ if .System }}<|start_header_id|>system<|end_header_id|>

{{ .System }}<|eot_id|>{{ end }}{{ if .Prompt }}<|start_header_id|>user<|end_header_id|>

{{ .Prompt }}<|eot_id|>{{ end }}<|start_header_id|>assistant<|end_header_id|>

{{ .Response }}<|eot_id|>