benedict/linkbricks-llama3.1-korean

benedict/ linkbricks-llama3.1-korean

2,754 Downloads Updated 1 year ago

NousResearch/Meta-Llama-3.1-8B-Instruct Korean finetuned model with SFT->RLHF->DPO

tools 8b 70b

ollama run benedict/linkbricks-llama3.1-korean:8b

curl http://localhost:11434/api/chat \
  -d '{
    "model": "benedict/linkbricks-llama3.1-korean:8b",
    "messages": [{"role": "user", "content": "Hello!"}]
  }'

from ollama import chat

response = chat(
    model='benedict/linkbricks-llama3.1-korean:8b',
    messages=[{'role': 'user', 'content': 'Hello!'}],
)
print(response.message.content)

import ollama from 'ollama'

const response = await ollama.chat({
  model: 'benedict/linkbricks-llama3.1-korean:8b',
  messages: [{role: 'user', content: 'Hello!'}],
})
console.log(response.message.content)

Applications

Claude Code

Claude Code ollama launch claude --model benedict/linkbricks-llama3.1-korean:8b

Codex App

Codex App ollama launch codex-app --model benedict/linkbricks-llama3.1-korean:8b

OpenClaw

OpenClaw ollama launch openclaw --model benedict/linkbricks-llama3.1-korean:8b

Hermes Agent

Hermes Agent ollama launch hermes --model benedict/linkbricks-llama3.1-korean:8b

Codex

Codex ollama launch codex --model benedict/linkbricks-llama3.1-korean:8b

OpenCode

OpenCode ollama launch opencode --model benedict/linkbricks-llama3.1-korean:8b

Models

Name

2 models

Size / Usage

Context

Input

linkbricks-llama3.1-korean:8b

8.5GB · 128K context window · Text · 1 year ago

linkbricks-llama3.1-korean:8b

8.5GB

128K

Text

linkbricks-llama3.1-korean:70b

50GB · 128K context window · Text · 1 year ago

linkbricks-llama3.1-korean:70b

50GB

128K

Text

Readme

AI 와 빅데이터 분석 전문 기업인 Linkbricks의 데이터사이언티스트인 지윤성 박사(Saxo)가 NousResearch/Meta-Llama-3.1-8B-Instruct 베이스모델을 KT-CLOUD상의 H100-80G 4개를 통해 SFT->RLHF->DPO 파인 튜닝을 한 한글 언어 모델로 한국어-중국어-영어-일본어 교차 학습 데이터와 로지컬 데이터를 통하여 한중일영 언어 교차 증강 처리와 복잡한 한글 논리 문제 역시 대응 가능하도록 훈련한 모델이며 토크나이저는 단어 확장 없이 베이스 모델 그대로 사용. 특히 고객 리뷰나 소셜 포스팅 고차원 분석 및 코딩등이 강화된 모델, 128k-Context Window, Tool Calling 지원 Deepspeed Stage=3, rslora, flash attention 2 를 사용

Dr. Yunsung Ji (Saxo), a data scientist at Linkbricks, a company specializing in AI and big data analytics, fine-tuned the NousResearch/Meta-Llama-3.1-8B-Instruct base model with SFT->RLHF->DPO using four H100-80Gs on KT-CLOUD. It is a Korean language model trained to handle complex Korean logic problems through Korean-Chinese-English-Japanese cross-training data and logical data, and Tokenizer uses the base model without word expansion.

www.linkbricks.com, www.linkbricks.vc

![linkbricks.png](https://ollama.com/assets/benedict/linkbricks-llama3.1-korean/d1a1dcf2-4494-4b4a-837d-9f2041fd0874)

AI 와 빅데이터 분석 전문 기업인 Linkbricks의 데이터사이언티스트인 지윤성 박사(Saxo)가 NousResearch/Meta-Llama-3.1-8B-Instruct 베이스모델을 KT-CLOUD상의 H100-80G 4개를 통해 SFT->RLHF->DPO 파인 튜닝을 한
한글 언어 모델로 한국어-중국어-영어-일본어 교차 학습 데이터와 로지컬 데이터를 통하여 한중일영 언어 교차 증강 처리와 복잡한 한글 논리 문제 역시 대응 가능하도록 훈련한 모델이며 토크나이저는 단어 확장 없이 베이스 모델 그대로 사용. 
특히 고객 리뷰나 소셜 포스팅 고차원 분석 및 코딩등이 강화된 모델, 128k-Context Window, Tool Calling 지원 
Deepspeed Stage=3, rslora, flash attention 2 를 사용

Dr. Yunsung Ji (Saxo), a data scientist at Linkbricks, a company specializing in AI and big data analytics, fine-tuned the NousResearch/Meta-Llama-3.1-8B-Instruct base model with SFT->RLHF->DPO using four H100-80Gs on KT-CLOUD.
It is a Korean language model trained to handle complex Korean logic problems through Korean-Chinese-English-Japanese cross-training data and logical data, and Tokenizer uses the base model without word expansion.

www.linkbricks.com, www.linkbricks.vc

Paste, drop or click to upload images (.png, .jpeg, .jpg, .svg, .gif)