The first open-source successful RL attempt on already long-COT finetuned models of simialr sizes under light budget. Light-R1-14B is also the State-Of-The-Art 14B math model with AIME24 & 25 scores 74.0 & 60.2, outperforming many 32B models.

7b 14b 32b

ollama run zhinao/light-r1:7b-q8

curl http://localhost:11434/api/chat \
  -d '{
    "model": "zhinao/light-r1:7b-q8",
    "messages": [{"role": "user", "content": "Hello!"}]
  }'

from ollama import chat

response = chat(
    model='zhinao/light-r1:7b-q8',
    messages=[{'role': 'user', 'content': 'Hello!'}],
)
print(response.message.content)

import ollama from 'ollama'

const response = await ollama.chat({
  model: 'zhinao/light-r1:7b-q8',
  messages: [{role: 'user', content: 'Hello!'}],
})
console.log(response.message.content)

Details

Updated 1 year ago

1 year ago

96144618264a · 8.1GB ·

model

archqwen2

parameters7.62B

quantizationQ8_0

8.1GB

template

"{{- if .System }}{{ .System }}{{ end }} {{- range $i, $_ := .Messages }} {{- $last := eq (len (slic

395B

params

{ "stop": [ "<｜begin▁of▁sentence｜>", "<｜end▁of▁sentence｜>",

148B

Readme

Reference

Github