AlphaMonarch-7B is a new DPO merge that retains all the reasoning abilities of the very best merges and significantly improves its conversational abilities.

7b

496 10 months ago

8b2a9bbd29a3 · 45B
{
"num_ctx": 8192,
"stop": [
"[INST]",
"[/INST]"
]
}