Tags · wangrongsheng/sfr-iterative-dpo-llama-3-8b-r

wangrongsheng/ sfr-iterative-dpo-llama-3-8b-r

278 Downloads Updated 2 years ago

SFR-Iterative-DPO-LLaMA-3-8B-R is a further (SFT and RLHF) fine-tuned model on LLaMA-3-8B, which provides good performance. The model is from Salesforce team.

Name

1 model

Size / Usage

Context

Input

sfr-iterative-dpo-llama-3-8b-r:latest

df04f1eec67a • 4.7GB • 8K context window • Text input • 2 years ago

Text input • 2 years ago

sfr-iterative-dpo-llama-3-8b-r:latest

4.7GB

8K

Text

df04f1eec67a · 2 years ago