Models
GitHub
Discord
Turbo
Sign in
Download
Models
Download
GitHub
Discord
Sign in
zhinao
/
light-r1
:32b
5,058
Downloads
Updated
5 months ago
The first open-source successful RL attempt on already long-COT finetuned models of simialr sizes under light budget. Light-R1-14B is also the State-Of-The-Art 14B math model with AIME24 & 25 scores 74.0 & 60.2, outperforming many 32B models.
The first open-source successful RL attempt on already long-COT finetuned models of simialr sizes under light budget. Light-R1-14B is also the State-Of-The-Art 14B math model with AIME24 & 25 scores 74.0 & 60.2, outperforming many 32B models.
Cancel
7b
14b
32b
Updated 6 months ago
6 months ago
89040324e2ad · 20GB
model
arch
qwen2
·
parameters
32.8B
·
quantization
Q4_K_M
20GB
template
"{{- if .System }}{{ .System }}{{ end }} {{- range $i, $_ := .Messages }} {{- $last := eq (len (slic
395B
params
{ "stop": [ "<|begin▁of▁sentence|>", "<|end▁of▁sentence|>",
148B
Readme
Reference
Github
Write
Preview
# Reference [Github](https://github.com/Qihoo360/Light-R1)
Paste, drop or click to upload images (.png, .jpeg, .jpg, .svg, .gif)