Trained through RLHF based off Llama-3-70B-Instruct, high scores on Arena-Hard-Auto.
386 Pulls Updated 3 months ago
Updated 3 months ago
3 months ago
b57fc3f5453e · 43GB
model
archllama
·
parameters70.6B
·
quantizationQ4_K_M
43GB
params
{"num_keep":24,"stop":["\u003c|start_header_id|\u003e","\u003c|end_header_id|\u003e","\u003c|eot_id|
110B
template
{{ if .System }}<|start_header_id|>system<|end_header_id|>
{{ .System }}<|eot_id|>{{ end }}{{ if .P
255B
license
META LLAMA 3 COMMUNITY LICENSE AGREEMENT
Meta Llama 3 Version Release Date: April 18, 2024
“Agree
12kB
Readme
GGUF source: https://huggingface.co/bullerwins/Athene-70B-GGUF Original source: https://huggingface.co/Nexusflow/Athene-70B
Llama3-Athene-70B
We introduce Llama3-Athene-70B, an open-weights LLM trained through RLHF based off Llama-3-70B-Instruct. Athene-70B achieves a high score on Arena-Hard-Auto, a proxy benchmark for Chatbot Arena.
- Developed by: The Nexusflow Team (Evan Frick, Peter Jin, Tianle Li*, Karthik Ganesan, Jian Zhang, Jiantao Jiao and Banghua Zhu).
- Model type: Chat Model
- Finetuned from model: Llama-3-70B-Instruct.