2,438 9 months ago

Typhoon2 8B Instruct - 8B parameters Thai / English bilingual LLM.

tools

Models

View all →

Readme

Llama3.1-Typhoon2-8B: Thai Large Language Model (Instruct) - Q4_K_M quantized

Llama3.1-Typhoon2-8B-instruct is a instruct Thai 🇹🇭 large language model with 8 billion parameters, and it is based on Llama3.1-8B.

*To acknowledge Meta’s effort in creating the foundation model and to comply with the license, we explicitly include “llama-3.1” in the model name.

Run

ollama run scb10x/llama3.1-typhoon2-8b-instruct

Performance

Instruction-Following & Function Call Performance

Typhoon2 Llama 8B General Performance

Specific Domain Performance (Math & Coding)

TTyphoon2 Llama 8B Specific Domain Performance

Long Context Performance

Typhoon2 Llama 8B Long Context Performance

Detail Performance

Model IFEval - TH IFEval - EN MT-Bench TH MT-Bench EN Thai Code-Switching(t=0.7) Thai Code-Switching(t=1.0) FunctionCall-TH FunctionCall-EN GSM8K-TH GSM8K-EN MATH-TH MATH-EN HumanEval-TH HumanEval-EN MBPP-TH MBPP-EN
Llama3.1 8B Instruct 58.04% 77.64% 5.109 8.118 93% 11.2% 36.92% 66.06% 45.18% 62.4% 24.42% 48% 51.8% 67.7% 64.6% 66.9%
Typhoon2 Llama3 8B Instruct 72.60% 76.43% 5.7417 7.584 98.8% 98% 75.12% 79.08% 71.72% 81.0% 38.48% 49.04% 58.5% 68.9% 60.8% 63.0%

Link

Arxiv Huggingface