asedmammad/lhk-dpo:v1-q5_K

asedmammad/ lhk-dpo:v1-q5_K_M

23 Downloads Updated 1 year ago

DPO finetuned model from FusionNet_7Bx2_MoE_14B

ollama run asedmammad/lhk-dpo:v1-q5_K_M

curl http://localhost:11434/api/chat \
  -d '{
    "model": "asedmammad/lhk-dpo:v1-q5_K_M",
    "messages": [{"role": "user", "content": "Hello!"}]
  }'

from ollama import chat

response = chat(
    model='asedmammad/lhk-dpo:v1-q5_K_M',
    messages=[{'role': 'user', 'content': 'Hello!'}],
)
print(response.message.content)

import ollama from 'ollama'

const response = await ollama.chat({
  model: 'asedmammad/lhk-dpo:v1-q5_K_M',
  messages: [{role: 'user', content: 'Hello!'}],
})
console.log(response.message.content)

Details

Updated 1 year ago

1 year ago

2aef458b7638 · 9.1GB ·

model

archllama

parameters12.9B

quantizationQ5_K_M

9.1GB

params

{ "stop": [ "[INST]", "[/INST]" ] }

30B

template

[INST] {{ .System }} {{ .Prompt }} [/INST]

42B

Readme

LHK_DPO_v1 is trained via Direct Preference Optimization(DPO) from https://huggingface.co/TomGrc/FusionNet_7Bx2_MoE_14B.

Original model is from https://huggingface.co/HanNayeoniee/LHK_DPO_v1