vanilj/ llama-3.1-instruct-bellman-8b-swedish:q8_0

312 Downloads Updated 1 year ago

This version of bellman is finetuned from llama-3.1-instruct-8b. It's finetuned for prompt question answering, based on a dataset created from Swedish wikipedia, with a lot of Sweden-centric questions.

tools

ollama run vanilj/llama-3.1-instruct-bellman-8b-swedish:q8_0

curl http://localhost:11434/api/chat \
  -d '{
    "model": "vanilj/llama-3.1-instruct-bellman-8b-swedish:q8_0",
    "messages": [{"role": "user", "content": "Hello!"}]
  }'

from ollama import chat

response = chat(
    model='vanilj/llama-3.1-instruct-bellman-8b-swedish:q8_0',
    messages=[{'role': 'user', 'content': 'Hello!'}],
)
print(response.message.content)

import ollama from 'ollama'

const response = await ollama.chat({
  model: 'vanilj/llama-3.1-instruct-bellman-8b-swedish:q8_0',
  messages: [{role: 'user', content: 'Hello!'}],
})
console.log(response.message.content)

Details

Updated 1 year ago

1 year ago

c9c69d6def7a · 8.5GB ·

model

archllama

·

parameters8.03B

·

quantizationQ8_0

8.5GB

license

LLAMA 3.1 COMMUNITY LICENSE AGREEMENT Llama 3.1 Version Release Date: July 23, 2024 “Agreement”

12kB

params

{ "stop": [ "<|start_header_id|>", "<|end_header_id|>", "<|eot_id|>"

96B

template

{{- if or .System .Tools }}<|start_header_id|>system<|end_header_id|> {{- if .System }} {{ .System }

1.5kB

Readme

From neph1/llama-3.1-instruct-bellman-8b-swedish

Model Card for Bellman

This version of bellman is finetuned from llama-3.1-instruct-8b. It’s finetuned for prompt question answering, based on a dataset created from Swedish wikipedia, with a lot of Sweden-centric questions. New from previous versions is questions from a translated code-feedback dataset, as well as a number of stories. It’s not great at generating stories, but better than previosly.

Try out the Q8 version here: https://huggingface.co/spaces/neph1/bellman (cpu)

Model Details

Training run on 240724:

Step Training Loss Validation Loss
25 1.352200 1.034565
50 1.033600 1.009348
75 1.022400 0.996665
100 1.002900 0.988050
125 1.014600 0.981633
150 1.006300 0.975584
175 0.988800 0.970966
200 0.985300 0.967037
225 0.992400 0.964120
250 0.950000 0.962472
275 0.931000 0.960848
300 0.932000 0.958946 <– picked checkpoint

Training Parameters

per_device_train_batch_size = 4,
gradient_accumulation_steps = 16,
num_train_epochs=3,
warmup_steps = 5,
learning_rate = 3e-5,
logging_steps = 25,
optim = “adamw_8bit”,
weight_decay = 0.01,
lr_scheduler_type = “linear”,
seed = 3407,
per_device_eval_batch_size = 2,
eval_strategy=“steps”,
eval_accumulation_steps = 32,
eval_steps = 25,
eval_delay = 0,
save_strategy=“steps”,
save_steps=50,

Model Description

Developed by: Me
Funded by: Me
Model type: Instruct
Language(s) (NLP): Swedish
License: llama-3.1
Finetuned from model: Llama3.1 Instruct 8b

Model Card Contact

rickard@mindemia.com

From [neph1/llama-3.1-instruct-bellman-8b-swedish](https://huggingface.co/neph1/llama-3.1-instruct-bellman-8b-swedish)

# Model Card for Bellman

This version of bellman is finetuned from llama-3.1-instruct-8b.
It's finetuned for prompt question answering, based on a dataset created from 
Swedish wikipedia, with a lot of Sweden-centric questions.
New from previous versions is questions from a translated code-feedback dataset, as well as a number of stories. It's not great at generating stories, 
but better than previosly.

Try out the Q8 version here: https://huggingface.co/spaces/neph1/bellman (cpu)

![image/png](https://cdn-uploads.huggingface.co/production/uploads/653cd3049107029eb004f968/IDGX3d9lGe6yx-yHjsrav.png)

## Model Details

Training run on 240724:

Step 	Training Loss 	Validation Loss<br>
25 	    1.352200 	1.034565<br>
50 	    1.033600 	1.009348<br>
75 	    1.022400 	0.996665<br>
100 	1.002900 	0.988050<br>
125 	1.014600 	0.981633<br>
150 	1.006300 	0.975584<br>
175 	0.988800 	0.970966<br>
200 	0.985300 	0.967037<br>
225 	0.992400 	0.964120<br>
250 	0.950000 	0.962472<br>
275 	0.931000 	0.960848<br>
300 	0.932000 	0.958946 <-- picked checkpoint <br>

### Training Parameters
  per_device_train_batch_size = 4,<br>
  gradient_accumulation_steps = 16,<br>
  num_train_epochs=3,<br>
  warmup_steps = 5,<br>
  learning_rate = 3e-5,<br>
  logging_steps = 25,<br>
  optim = "adamw_8bit",<br>
  weight_decay = 0.01,<br>
  lr_scheduler_type = "linear",<br>
  seed = 3407,<br>
  per_device_eval_batch_size = 2,<br>
  eval_strategy="steps",<br>
  eval_accumulation_steps = 32,<br>
  eval_steps = 25,<br>
  eval_delay = 0,<br>
  save_strategy="steps",<br>
  save_steps=50,<br>
  
### Model Description

- **Developed by:** Me
- **Funded by:** Me
- **Model type:** Instruct
- **Language(s) (NLP):** Swedish
- **License:** llama-3.1
- **Finetuned from model:** Llama3.1 Instruct 8b

## Model Card Contact

rickard@mindemia.com

Paste, drop or click to upload images (.png, .jpeg, .jpg, .svg, .gif)