zhinao/
light-r1:7b-q8

5,058 5 months ago

The first open-source successful RL attempt on already long-COT finetuned models of simialr sizes under light budget. Light-R1-14B is also the State-Of-The-Art 14B math model with AIME24 & 25 scores 74.0 & 60.2, outperforming many 32B models.

7b 14b 32b

5 months ago

96144618264a · 8.1GB

qwen2
·
7.62B
·
Q8_0
"{{- if .System }}{{ .System }}{{ end }} {{- range $i, $_ := .Messages }} {{- $last := eq (len (slic
{ "stop": [ "<|begin▁of▁sentence|>", "<|end▁of▁sentence|>",

Readme

Reference

Github