Experimental model doing a DPO training on top of Kunoichi-DPO-v2-7b, i.e. double-DPO.

7B

152 Pulls Updated 3 months ago

Readme

Source: https://huggingface.co/crestf411/daybreak-kunoichi-2dpo-7b

Experimental model doing a DPO training on top of Kunoichi-DPO-v2-7b, i.e. double-DPO.