Models
GitHub
Discord
Turbo
Sign in
Download
Models
Download
GitHub
Discord
Sign in
mannix
/
llama3-sppo-iter3
:q8_0
235
Downloads
Updated
1 year ago
Meta Llama-3-8b with Self-Play Preference Optimization for Language Model Alignment at iteration 3
Meta Llama-3-8b with Self-Play Preference Optimization for Language Model Alignment at iteration 3
Cancel
llama3-sppo-iter3:q8_0
...
/
params
577073ffcc6c · 110B
{
"num_keep": 24,
"stop": [
"<|start_header_id|>",
"<|end_header_id|>",
"<|eot_id|>"
]
}