526 1 year ago

Mistral-Nemo-Instruct-2407 korean finetuned model with SFT->DPO

tools 12b
83a43c6a3a0f · 44B
{
"num_keep": 10,
"stop": [
"[INST]",
"[/INST]"
]
}