Models
GitHub
Discord
Docs
Cloud
Sign in
Download
Models
Download
GitHub
Discord
Docs
Cloud
Sign in
wangrongsheng
/
sfr-iterative-dpo-llama-3-8b-r
215
Downloads
Updated
1 year ago
SFR-Iterative-DPO-LLaMA-3-8B-R is a further (SFT and RLHF) fine-tuned model on LLaMA-3-8B, which provides good performance. The model is from Salesforce team.
SFR-Iterative-DPO-LLaMA-3-8B-R is a further (SFT and RLHF) fine-tuned model on LLaMA-3-8B, which provides good performance. The model is from Salesforce team.
Cancel
Name
1 model
Size
Context
Input
sfr-iterative-dpo-llama-3-8b-r:latest
df04f1eec67a
• 4.7GB • 8K context window •
Text input • 1 year ago
Text input • 1 year ago
sfr-iterative-dpo-llama-3-8b-r:latest
4.7GB
8K
Text
df04f1eec67a
· 1 year ago