AlphaMonarch-7B is a new DPO merge that retains all the reasoning abilities of the very best merges and significantly improves its conversational abilities.

7b

481 9 months ago

3b1bc934d80a · 74B
{
"num_ctx": 8192,
"stop": [
"<|im_start|>",
"<|im_end|>"
]
}