scb10x/llama3.1-typhoon2-8b-instruct

scb10x/ llama3.1-typhoon2-8b-instruct

6,525 Downloads Updated 1 year ago

Typhoon2 8B Instruct - 8B parameters Thai / English bilingual LLM.

tools

ollama run scb10x/llama3.1-typhoon2-8b-instruct

curl http://localhost:11434/api/chat \
  -d '{
    "model": "scb10x/llama3.1-typhoon2-8b-instruct",
    "messages": [{"role": "user", "content": "Hello!"}]
  }'

from ollama import chat

response = chat(
    model='scb10x/llama3.1-typhoon2-8b-instruct',
    messages=[{'role': 'user', 'content': 'Hello!'}],
)
print(response.message.content)

import ollama from 'ollama'

const response = await ollama.chat({
  model: 'scb10x/llama3.1-typhoon2-8b-instruct',
  messages: [{role: 'user', content: 'Hello!'}],
})
console.log(response.message.content)

Applications

Claude Code

Claude Code ollama launch claude --model scb10x/llama3.1-typhoon2-8b-instruct

OpenCode

OpenCode ollama launch opencode --model scb10x/llama3.1-typhoon2-8b-instruct

Hermes Agent

Hermes Agent ollama launch hermes --model scb10x/llama3.1-typhoon2-8b-instruct

OpenClaw

OpenClaw ollama launch openclaw --model scb10x/llama3.1-typhoon2-8b-instruct

Models

Name

1 model

Size / Usage

Context

Input

llama3.1-typhoon2-8b-instruct:latest

4.9GB · 128K context window · Text · 1 year ago

llama3.1-typhoon2-8b-instruct:latest

4.9GB

128K

Text

Readme

Llama3.1-Typhoon2-8B: Thai Large Language Model (Instruct) - Q4_K_M quantized

Llama3.1-Typhoon2-8B-instruct is a instruct Thai 🇹🇭 large language model with 8 billion parameters, and it is based on Llama3.1-8B.

By using this model, you agree to the OpenTyphoon Terms and Conditions and acknowledge the Privacy Notice: https://opentyphoon.ai/tac · https://opentyphoon.ai/privacy

*To acknowledge Meta’s effort in creating the foundation model and to comply with the license, we explicitly include “llama-3.1” in the model name.

Run

ollama run scb10x/llama3.1-typhoon2-8b-instruct

Performance

Instruction-Following & Function Call Performance

Typhoon2 Llama 8B General Performance

Specific Domain Performance (Math & Coding)

TTyphoon2 Llama 8B Specific Domain Performance

Long Context Performance

Typhoon2 Llama 8B Long Context Performance

Detail Performance

Model	IFEval - TH	IFEval - EN	MT-Bench TH	MT-Bench EN	Thai Code-Switching(t=0.7)	Thai Code-Switching(t=1.0)	FunctionCall-TH	FunctionCall-EN	GSM8K-TH	GSM8K-EN	MATH-TH	MATH-EN	HumanEval-TH	HumanEval-EN	MBPP-TH	MBPP-EN
Llama3.1 8B Instruct	58.04%	77.64%	5.109	8.118	93%	11.2%	36.92%	66.06%	45.18%	62.4%	24.42%	48%	51.8%	67.7%	64.6%	66.9%
Typhoon2 Llama3 8B Instruct	72.60%	76.43%	5.7417	7.584	98.8%	98%	75.12%	79.08%	71.72%	81.0%	38.48%	49.04%	58.5%	68.9%	60.8%	63.0%

Link

Arxiv Huggingface

**Llama3.1-Typhoon2-8B**: Thai Large Language Model (Instruct) - Q4_K_M quantized

**Llama3.1-Typhoon2-8B-instruct** is a instruct Thai 🇹🇭 large language model with 8 billion parameters, and it is based on Llama3.1-8B.

By using this model, you agree to the OpenTyphoon Terms and Conditions and acknowledge the Privacy Notice: https://opentyphoon.ai/tac · https://opentyphoon.ai/privacy

*To acknowledge Meta's effort in creating the foundation model and to comply with the license, we explicitly include "llama-3.1" in the model name.

## **Run**
```
ollama run scb10x/llama3.1-typhoon2-8b-instruct
```

## **Performance**

**Instruction-Following & Function Call Performance**

<div align="center">
  <img src="https://storage.googleapis.com/typhoon-public/assets/typhoon2-text/llama7b_general.png" alt="Typhoon2 Llama 8B General Performance" width="100%" style="margin-left:'auto' margin-right:'auto' display:'block'"/>
</div>

**Specific Domain Performance (Math & Coding)**

<div align="center">
  <img src="https://storage.googleapis.com/typhoon-public/assets/typhoon2-text/llama7b_specific.png" alt="TTyphoon2 Llama 8B Specific Domain Performance" width="100%" style="margin-left:'auto' margin-right:'auto' display:'block'"/>
</div>

**Long Context Performance**

<div align="center">
  <img src="https://storage.googleapis.com/typhoon-public/assets/typhoon2-text/llama7b_long.jpg" alt="Typhoon2 Llama 8B Long Context Performance" width="100%" style="margin-left:'auto' margin-right:'auto' display:'block'"/>
</div>

**Detail Performance**

| Model                          | IFEval - TH | IFEval - EN | MT-Bench TH | MT-Bench EN | Thai Code-Switching(t=0.7) | Thai Code-Switching(t=1.0) | FunctionCall-TH     | FunctionCall-EN     | GSM8K-TH  | GSM8K-EN  | MATH-TH   | MATH-EN   | HumanEval-TH | HumanEval-EN | MBPP-TH   | MBPP-EN   |
|--------------------------------|-------------|-------------|-------------|-------------|--------------------------------|--------------------------------|-----------|-----------|-----------|-----------|-----------|-----------|-------------|-------------|-----------|-----------|
| **Llama3.1 8B Instruct**       | 58.04%      | **77.64%**  | 5.109       | **8.118**   | 93%                            | 11.2%                         | 36.92%    | 66.06%    | 45.18%    | 62.4%     | 24.42%    | 48%       | 51.8%       | 67.7%       | **64.6%**  | **66.9%**  |
| **Typhoon2 Llama3 8B Instruct**| **72.60%**  | 76.43%      | **5.7417**  | 7.584       | **98.8%**                      | **98%**                       | **75.12%** | **79.08%** | **71.72%** | **81.0%**  | **38.48%** | **49.04%** | **58.5%**    | **68.9%**    | 60.8%     | 63.0%     |

## *Link*

[Arxiv](https://arxiv.org/abs/2412.13702)
[Huggingface](https://huggingface.co/scb10x/llama3.1-typhoon2-8b-instruct)

Paste, drop or click to upload images (.png, .jpeg, .jpg, .svg, .gif)