latest
4.7GB
SFR-Iterative-DPO-LLaMA-3-8B-R is a further (SFT and RLHF) fine-tuned model on LLaMA-3-8B, which provides good performance. The model is from Salesforce team.
8B
135 Pulls Updated 4 months ago
3dd6d66d7bdc · 34B
You're a very useful AI assistant.