Models
GitHub
Discord
Turbo
Sign in
Download
Models
Download
GitHub
Discord
Sign in
mannix
/
gemma2-9b-sppo-iter3
:q5_1
998
Downloads
Updated
1 year ago
This model was developed using Self-Play Preference Optimization at iteration 3, based on the google/gemma-2-9b-it architecture as starting point.
This model was developed using Self-Play Preference Optimization at iteration 3, based on the google/gemma-2-9b-it architecture as starting point.
Cancel
gemma2-9b-sppo-iter3:q5_1
...
/
params
1f0c17ce1cdb · 118B
{
"num_ctx": 4096,
"num_predict": 4096,
"repeat_penalty": 1,
"stop": [
"<start_of_turn>",
"<end_of_turn>"
]
}