chand1012/
rocket:latest

323 1 year ago

A 3B parameter GPT-like model fine-tuned on a mix of publicly available datasets using DPO.

bf328696c54f · 34B
{
"stop": [
"<|im_end|>"
]
}