dnotitia/
dna:8b-instruct-q4_K

699 9 months ago

DNA 1.0 8B Instruct is a state-of-the-art (SOTA) bilingual language model specifically optimized for Korean language, while also maintaining strong English capabilities.

8b

9 months ago

3356d2c78b90 · 4.9GB ·

llama
·
8.03B
·
Q4_K_M
You are a helpful assistant, Dnotitia DNA.
{ "stop": [ "<|start_header_id|>", "<|end_header_id|>", "<|eot_id|>"
{{- if or .System }}<|start_header_id|>system<|end_header_id|> {{- if .System }} {{ .System }} {{- e

Readme

assets_dna-logo.png

DNA 1.0 8B Instruct is a state-of-the-art (SOTA) bilingual language model based on Llama architecture, specifically optimized for Korean language understanding and generation, while also maintaining strong English capabilities. The model was developed through a sophisticated process involving model merging via spherical linear interpolation (SLERP) with Llama 3.1 8B Instruct, and underwent knowledge distillation (KD) using Llama 3.1 405B as the teacher model. It was extensively trained through continual pre-training (CPT) with a high-quality Korean dataset. The training pipeline was completed with supervised fine-tuning (SFT) and direct preference optimization (DPO) to align with human preferences and enhance instruction-following abilities.

assets_training-procedure.png

References