A 3B parameter GPT-like model fine-tuned on a mix of publicly available datasets using DPO.

153 Pulls Updated 7 months ago

bf328696c54f · 34B
{ "stop": [ "<|im_end|>" ] }