Experimental model doing a DPO training on top of Kunoichi-DPO-v2-7b, i.e. double-DPO.
230 Pulls Updated 11 months ago
8ab4849b038c · 254B
{{ if .System }}<|start_header_id|>system<|end_header_id|>
{{ .System }}<|eot_id|>{{ end }}{{ if .Prompt }}<|start_header_id|>user<|end_header_id|>
{{ .Prompt }}<|eot_id|>{{ end }}<|start_header_id|>assistant<|end_header_id|>
{{ .Response }}<|eot_id|>