mannix/ smaug-llama3-8b

360 Downloads Updated 2 years ago

This model was built using the Smaug recipe for improving performance on real world multi-turn conversations applied to meta-llama/Meta-Llama-3-8B-Instruct.

ollama run mannix/smaug-llama3-8b

curl http://localhost:11434/api/chat \
  -d '{
    "model": "mannix/smaug-llama3-8b",
    "messages": [{"role": "user", "content": "Hello!"}]
  }'

from ollama import chat

response = chat(
    model='mannix/smaug-llama3-8b',
    messages=[{'role': 'user', 'content': 'Hello!'}],
)
print(response.message.content)

import ollama from 'ollama'

const response = await ollama.chat({
  model: 'mannix/smaug-llama3-8b',
  messages: [{role: 'user', content: 'Hello!'}],
})
console.log(response.message.content)

Models

Name

22 models

Size / Usage

Context

Input

smaug-llama3-8b:latest

4.7GB · 8K context window · Text · 2 years ago

smaug-llama3-8b:latest

4.7GB

8K

Text

smaug-llama3-8b:q2_k

3.2GB · 8K context window · Text · 2 years ago

smaug-llama3-8b:q2_k

3.2GB

8K

Text

smaug-llama3-8b:q3_k_s

3.7GB · 8K context window · Text · 2 years ago

smaug-llama3-8b:q3_k_s

3.7GB

8K

Text

smaug-llama3-8b:q3_k_m

4.0GB · 8K context window · Text · 2 years ago

smaug-llama3-8b:q3_k_m

4.0GB

8K

Text

smaug-llama3-8b:q3_k_l

4.3GB · 8K context window · Text · 2 years ago

smaug-llama3-8b:q3_k_l

4.3GB

8K

Text

smaug-llama3-8b:q4_0

4.7GB · 8K context window · Text · 2 years ago

smaug-llama3-8b:q4_0

4.7GB

8K

Text

smaug-llama3-8b:q4_1

5.1GB · 8K context window · Text · 2 years ago

smaug-llama3-8b:q4_1

5.1GB

8K

Text

smaug-llama3-8b:q4_k_s

4.7GB · 8K context window · Text · 2 years ago

smaug-llama3-8b:q4_k_s

4.7GB

8K

Text

smaug-llama3-8b:q4_k_m

4.9GB · 8K context window · Text · 2 years ago

smaug-llama3-8b:q4_k_m

4.9GB

8K

Text

smaug-llama3-8b:q5_0

5.6GB · 8K context window · Text · 2 years ago

smaug-llama3-8b:q5_0

5.6GB

8K

Text

smaug-llama3-8b:q5_1

6.1GB · 8K context window · Text · 2 years ago

smaug-llama3-8b:q5_1

6.1GB

8K

Text

smaug-llama3-8b:q5_k_s

5.6GB · 8K context window · Text · 2 years ago

smaug-llama3-8b:q5_k_s

5.6GB

8K

Text

smaug-llama3-8b:q5_k_m

5.7GB · 8K context window · Text · 2 years ago

smaug-llama3-8b:q5_k_m

5.7GB

8K

Text

smaug-llama3-8b:q6_k

6.6GB · 8K context window · Text · 2 years ago

smaug-llama3-8b:q6_k

6.6GB

8K

Text

smaug-llama3-8b:q8_0

8.5GB · 8K context window · Text · 2 years ago

smaug-llama3-8b:q8_0

8.5GB

8K

Text

smaug-llama3-8b:iq2_xxs

2.4GB · 8K context window · Text · 2 years ago

smaug-llama3-8b:iq2_xxs

2.4GB

8K

Text

smaug-llama3-8b:iq2_xs

2.6GB · 8K context window · Text · 2 years ago

smaug-llama3-8b:iq2_xs

2.6GB

8K

Text

smaug-llama3-8b:iq2_s

2.8GB · 8K context window · Text · 2 years ago

smaug-llama3-8b:iq2_s

2.8GB

8K

Text

smaug-llama3-8b:iq3_xxs

3.3GB · 8K context window · Text · 2 years ago

smaug-llama3-8b:iq3_xxs

3.3GB

8K

Text

smaug-llama3-8b:iq3_s

3.7GB · 8K context window · Text · 2 years ago

smaug-llama3-8b:iq3_s

3.7GB

8K

Text

smaug-llama3-8b:iq4_xs

4.4GB · 8K context window · Text · 2 years ago

smaug-llama3-8b:iq4_xs

4.4GB

8K

Text

smaug-llama3-8b:iq4_nl

4.7GB · 8K context window · Text · 2 years ago

smaug-llama3-8b:iq4_nl

4.7GB

8K

Text

Readme

Llama-3-Smaug-8B

Quantizations with i-matrix groups_merged.txt, saftensors converted to fp32

Built with Meta Llama 3

This model was built using the Smaug recipe for improving performance on real world multi-turn conversations applied to meta-llama/Meta-Llama-3-8B-Instruct.

Model Description

Developed by: Abacus.AI
License: https://llama.meta.com/llama3/license/
Finetuned from model: meta-llama/Meta-Llama-3-8B-Instruct.

Evaluation

MT-Bench

########## First turn ##########
                   score
model             turn
Llama-3-Smaug-8B 1   8.77500
Meta-Llama-3-8B-Instruct 1   8.31250
########## Second turn ##########
                   score
model             turn
Meta-Llama-3-8B-Instruct 2   7.8875 
Llama-3-Smaug-8B 2   7.8875
########## Average ##########
                 score
model
Llama-3-Smaug-8B  8.331250
Meta-Llama-3-8B-Instruct 8.10

Model	First turn	Second Turn	Average
Llama-3-Smaug-8B	8.78	7.89	8.33
Llama-3-8B-Instruct	8.31	7.89	8.10

This version of Smaug uses new techniques and new data compared to Smaug-72B, and more information will be released later on. For now, see the previous Smaug paper: https://arxiv.org/abs/2402.13228.

# Llama-3-Smaug-8B
Quantizations with i-matrix `groups_merged.txt`, saftensors converted to fp32

### Built with Meta Llama 3

![image/png](https://cdn-uploads.huggingface.co/production/uploads/64c14f95cac5f9ba52bbcd7f/OrcJyTaUtD2HxJOPPwNva.png)

This model was built using the Smaug recipe  for improving performance on real world multi-turn conversations applied to 
[meta-llama/Meta-Llama-3-8B-Instruct](https://huggingface.co/meta-llama/Meta-Llama-3-8B-Instruct).

### Model Description

- **Developed by:** [Abacus.AI](https://abacus.ai)
- **License:** https://llama.meta.com/llama3/license/
- **Finetuned from model:** [meta-llama/Meta-Llama-3-8B-Instruct](https://huggingface.co/meta-llama/Meta-Llama-3-8B-Instruct).

## Evaluation

### MT-Bench

```
########## First turn ##########
                   score
model             turn
Llama-3-Smaug-8B 1   8.77500
Meta-Llama-3-8B-Instruct 1   8.31250
########## Second turn ##########
                   score
model             turn
Meta-Llama-3-8B-Instruct 2   7.8875 
Llama-3-Smaug-8B 2   7.8875
########## Average ##########
                 score
model
Llama-3-Smaug-8B  8.331250
Meta-Llama-3-8B-Instruct 8.10
```

| Model | First turn | Second Turn | Average |
| :---- | ---------: | ----------: | ------: |
| Llama-3-Smaug-8B | 8.78 | 7.89 | 8.33 |
| Llama-3-8B-Instruct | 8.31 |  7.89 | 8.10 |

This version of Smaug uses new techniques and new data compared to [Smaug-72B](https://huggingface.co/abacusai/Smaug-72B-v0.1), and more information will be released later on. For now, see the previous Smaug paper: https://arxiv.org/abs/2402.13228.

Paste, drop or click to upload images (.png, .jpeg, .jpg, .svg, .gif)