A 3B parameter GPT-like model fine-tuned on a mix of publicly available datasets using DPO.
153 Pulls Updated 7 months ago
1 Tag
d8e572d2a0b9 • 2.0GB •
Updated 7 months ago