A 3B parameter GPT-like model fine-tuned on a mix of publicly available datasets using DPO.
336 Pulls 1 Tag Updated 2 years ago