AlphaMonarch-7B is a new DPO merge that retains all the reasoning abilities of the very best merges and significantly improves its conversational abilities.

7b

496 10 months ago

e6836092461f · 42B
[INST] {{ .System }} {{ .Prompt }} [/INST]