207 6 months ago

Light-R1: Surpassing R1-Distill from Scratch* with $1000 through Curriculum SFT & DPO

tools

6 months ago

f0591eee8e9c · 20GB

qwen2
·
32.8B
·
Q4_K_M
{{- if .Suffix }}<|fim_prefix|>{{ .Prompt }}<|fim_suffix|>{{ .Suffix }}<|fim_middle|> {{- else if .M

Readme