3 5 months ago

Llama-SEA-LION-v2-8B-IT is a multilingual model which has been pretrained and instruct-tuned for the Southeast Asia region. Developed by AI Singapore and funded by National Research Foundation, Singapore.

5 months ago

049de9d9b466 · 4.9GB

llama
·
8.03B
·
Q4_K_M
{{ if .System}}<|begin_of_text|><|start_header_id|>system<|end_header_id|> {{ .System }}<|eot_id|>{{
META LLAMA 3 COMMUNITY LICENSE AGREEMENT Meta Llama 3 Version Release Date: April 18, 2024 “Agreem
{ "num_ctx": 4096, "stop": [ "<|start_header_id|>", "<|end_header_id|>",

Readme

Llama-SEA-LION-v2-8B-IT

SEA-LION is a collection of Large Language Models (LLMs) which has been pretrained and instruct-tuned for the Southeast Asia (SEA) region.

Llama-SEA-LION-v2-8B-IT is a multilingual model which has been fine-tuned with around 100,000 English instruction-completion pairs alongside a smaller pool of around 50,000 instruction-completion pairs from other ASEAN languages, such as Indonesian, Thai and Vietnamese. These instructions have been carefully curated and rewritten to ensure the model was trained on truly open, commercially permissive and high quality datasets.

SEA-LION stands for Southeast Asian Languages In One Network.

  • Developed by: Products Pillar, AI Singapore
  • Funded by: Singapore NRF
  • Model type: Decoder
  • Languages: English, Indonesian, Thai, Vietnamese, Tamil
  • License: Llama 3 Community License

For more details, please refer to AI Singapore’s HuggingFace page for this model. The original GGUF files can be obtained from this HuggingFace repository