A 3B parameter GPT-like model fine-tuned on a mix of publicly available datasets using DPO.

153 Pulls Updated 7 months ago

1 Tag
d8e572d2a0b9 • 2.0GB • Updated 7 months ago