NousResearch/Meta-Llama-3.1-8B-Instruct Korean finetuned model with SFT->RLHF->DPO

Tools 8B 70B

166 Pulls Updated 5 weeks ago

2 Tags
f0b142109ced • 8.5GB • Updated 5 weeks ago
10f7cc80af2f • 50GB • Updated 5 weeks ago