The first open-source successful RL attempt on already long-COT finetuned models of simialr sizes under light budget. Light-R1-14B is also the State-Of-The-Art 14B math model with AIME24 & 25 scores 74.0 & 60.2, outperforming many 32B models.
7b
14b
32b
2,487 Pulls Updated 2 weeks ago
8 Tags
cc9642ff5f94 • 4.7GB •
3 weeks ago
f948fbf72746 • 8.5GB •
4 weeks ago
89040324e2ad • 20GB •
3 weeks ago
5265fe47429b • 30GB •
2 weeks ago
a904f4de38d5 • 9.0GB •
4 weeks ago
2d011fa1ec95 • 16GB •
4 weeks ago
f23dc3e2b63e • 15GB •
2 weeks ago
96144618264a • 8.1GB •
3 weeks ago