https://huggingface.co/bosonai/Higgs-Llama-3-70B

ollama run eramax/higgs-llama3-70b:iq2xs

curl http://localhost:11434/api/chat \
  -d '{
    "model": "eramax/higgs-llama3-70b:iq2xs",
    "messages": [{"role": "user", "content": "Hello!"}]
  }'

from ollama import chat

response = chat(
    model='eramax/higgs-llama3-70b:iq2xs',
    messages=[{'role': 'user', 'content': 'Hello!'}],
)
print(response.message.content)

import ollama from 'ollama'

const response = await ollama.chat({
  model: 'eramax/higgs-llama3-70b:iq2xs',
  messages: [{role: 'user', content: 'Hello!'}],
})
console.log(response.message.content)

Details

Updated 2 years ago

2 years ago

16b90c963fc5 · 21GB ·

model

archllama

parameters70.6B

quantizationIQ2_XS

21GB

template

<|begin_of_text|><|start_header_id|>system<|end_header_id|> {{ .System }}<|eot_id|><|start_header_id

227B

params

{ "stop": [ "<|start_header_id|>", "<|end_header_id|>", "<|eot_id|>"

96B

Readme

Higgs-Llama-3-70B

Higgs-Llama-3-70B is post-trained from meta-llama/Meta-Llama-3-70B, specially tuned for role-playing while being competitive in general-domain instruction-following and reasoning.

We perform supervised fine-tuning with our in-house instruction-following and chat datasets. Afterwards, we construct preference pairs with a semi-automated pipeline that relies on both human-labelers and our private LLMs. We conduct iterative preference optimization to align the model. During alignment, we adopted a special strategy to align the model’s behavior with the system message. Compared with other instruct models, Higgs models follow their roles more closely.

See our release blog.