A 7B parameter GPT-like model fine-tuned on a mix of publicly available, synthetic datasets using DiscoPOP

26 Pulls Updated 3 months ago

2e5973480ff2 · 103B
{ "penalize_newline": false, "repeat_penalty": 1, "stop": [ "<|im_start|>", "<|im_end|>" ] }