NousResearch/Meta-Llama-3.1-8B-Instruct Korean finetuned model with SFT->RLHF->DPO

tools 8b 70b

685 3 months ago

2 Tags
f0b142109ced • 8.5GB • 3 months ago
10f7cc80af2f • 50GB • 3 months ago