chand1012/ rocket:latest

329 1 year ago

A 3B parameter GPT-like model fine-tuned on a mix of publicly available datasets using DPO.

bf328696c54f · 34B
{
"stop": [
"<|im_end|>"
]
}