Experimental model doing a DPO training on top of Kunoichi-DPO-v2-7b, i.e. double-DPO.

7B

152 Pulls Updated 3 months ago

b0eab53ce397 · 128B
You are a helpful, smart, kind, and efficient AI assistant. You always fulfill the user's requests to the best of your ability.