Trained through RLHF based off Llama-3-70B-Instruct, high scores on Arena-Hard-Auto.

386 3 months ago

0046e5dbf817 · 255B
{{ if .System }}<|start_header_id|>system<|end_header_id|>
{{ .System }}<|eot_id|>{{ end }}{{ if .Prompt }}<|start_header_id|>user<|end_header_id|>
{{ .Prompt }}<|eot_id|>{{ end }}<|start_header_id|>assistant<|end_header_id|>
{{ .Response }}<|eot_id|>