Mistral-Nemo-Instruct-2407 korean finetuned model with SFT->DPO

tools 12b

491 8 months ago

83a43c6a3a0f · 44B
{
"num_keep": 10,
"stop": [
"[INST]",
"[/INST]"
]
}