A 7B parameter GPT-like model fine-tuned on a mix of publicly available, synthetic datasets using DiscoPOP

35 5 months ago

2e5973480ff2 · 103B
{
"penalize_newline": false,
"repeat_penalty": 1,
"stop": [
"<|im_start|>",
"<|im_end|>"
]
}