mannix/gemma2-9b-sppo-iter3:q5_k_m/template

mannix/ gemma2-9b-sppo-iter3:q5_k_m

1,100 Downloads Updated 1 year ago

This model was developed using Self-Play Preference Optimization at iteration 3, based on the google/gemma-2-9b-it architecture as starting point.

gemma2-9b-sppo-iter3:q5_k_m ... /

template

c827688e53c8 · 137B

<start_of_turn>user

{{ if .System }}{{ .System }} {{ end }}{{ .Prompt }}<end_of_turn>

<start_of_turn>model

{{ .Response }}<end_of_turn>