dnotitia/
dna-r1:14b-q4_K_M

74 7 months ago

Reasoning model distilled from DeepSeek-R1, enhanced with GRPO using supplementary reasoning datasets.

14b

7 months ago

d3798a276509 · 9.1GB ·

phi3
·
14.7B
·
Q4_K_M
{{- if .System }}<|im_start|>system<|im_sep|>{{ .System }}<|im_end|>{{ end }} {{- range $i, $_ := .M
{ "num_predict": 4096, "stop": [ "<|im_end|>" ], "temperature": 0.1, "to

Readme

dna-r1-logo.png

We introduce DNA-R1, a specialized reasoning model optimized for Korean language based on Microsoft’s Phi-4. By applying large-scale reinforcement learning (RL) using the same methodology as DeepSeek-R1, we have significantly enhanced the model’s Korean reasoning capabilities. This model demonstrates deep understanding of Korean text and exhibits exceptional reasoning abilities across mathematics, coding, and general reasoning tasks.

dna-r1-pipeline.png

References