mannix/hermes-3-llama-3.1-70b:q2

Details

Updated 1 year ago

1 year ago

3051f44e85c7 · 26GB ·

model

archllama

parameters70.6B

quantizationQ2_K

26GB

template

{{ if .Messages }} {{- if or .System .Tools }}<|im_start|>system {{ .System }} {{- if .Tools }} You

1.8kB

license

LLAMA 3.1 COMMUNITY LICENSE AGREEMENT Llama 3.1 Version Release Date: July 23, 2024 “Agreement”

12kB

params

{ "stop": [ "<|im_start|>", "<|im_end|>", "<|eot_id|>", "<|begin

158B

Quantization from fp32
Using i-matrix calibration_datav3.txt
temperature set to 0.2: beware higher value will destroy reasoning and math capabilities of the model

Model Description

Hermes 3 is the latest version of our flagship Hermes series of LLMs by Nous Research.

For more details on new capabilities, training results, and more, see the Hermes 3 Technical Report.

Hermes 3 is a generalist language model with many improvements over Hermes 2, including advanced agentic capabilities, much better roleplaying, reasoning, multi-turn conversation, long context coherence, and improvements across the board.

The ethos of the Hermes series of models is focused on aligning LLMs to the user, with powerful steering capabilities and control given to the end user.

The Hermes 3 series builds and expands on the Hermes 2 set of capabilities, including more powerful and reliable function calling and structured output capabilities, generalist assistant capabilities, and improved code generation skills.

Benchmarks

Hermes 3 is competitive, if not superior, to Llama-3.1 Instruct models at general capabilities, with varying strengths and weaknesses attributable between the two.

Full benchmark comparisons below:

Hermes 3 Llama-3.1 70b Model by NousResearch

Details

Readme