Trained through RLHF based off Llama-3-70B-Instruct, high scores on Arena-Hard-Auto.

ollama run finalend/athene-70b:Q8_0

curl http://localhost:11434/api/chat \
  -d '{
    "model": "finalend/athene-70b:Q8_0",
    "messages": [{"role": "user", "content": "Hello!"}]
  }'

from ollama import chat

response = chat(
    model='finalend/athene-70b:Q8_0',
    messages=[{'role': 'user', 'content': 'Hello!'}],
)
print(response.message.content)

import ollama from 'ollama'

const response = await ollama.chat({
  model: 'finalend/athene-70b:Q8_0',
  messages: [{role: 'user', content: 'Hello!'}],
})
console.log(response.message.content)

Details

Updated 1 year ago

1 year ago

d982209906e2 · 75GB ·

model

archllama

parameters70.6B

quantizationQ8_0

75GB

license

META LLAMA 3 COMMUNITY LICENSE AGREEMENT Meta Llama 3 Version Release Date: April 18, 2024 “Agreem

12kB

params

{ "num_keep": 24, "stop": [ "<|start_header_id|>", "<|end_header_id|>",

110B

template

{{ if .System }}<|start_header_id|>system<|end_header_id|> {{ .System }}<|eot_id|>{{ end }}{{ if .Pr

255B

Readme

GGUF source: https://huggingface.co/bullerwins/Athene-70B-GGUF Original source: https://huggingface.co/Nexusflow/Athene-70B

Llama3-Athene-70B

We introduce Llama3-Athene-70B, an open-weights LLM trained through RLHF based off Llama-3-70B-Instruct. Athene-70B achieves a high score on Arena-Hard-Auto, a proxy benchmark for Chatbot Arena.

Developed by: The Nexusflow Team (Evan Frick, Peter Jin, Tianle Li*, Karthik Ganesan, Jian Zhang, Jiantao Jiao and Banghua Zhu).
Model type: Chat Model
Finetuned from model: Llama-3-70B-Instruct.

Blog: https://nexusflow.ai/blogs/athene