MHKetbi/DeepScaleR-1.5B-Preview/params

MHKetbi/

DeepScaleR-1.5B-Preview

DeepScaleR-1.5B-Preview is a language model fine-tuned from DeepSeek-R1-Distilled-Qwen-1.5B using distributed reinforcement learning (RL)

65 Pulls Updated 7 weeks ago

DeepScaleR-1.5B-Preview ... /

params

3df9ae758182 · 68B

{

"num_ctx": 131072,

"stop": [

"<｜end▁of▁sentence｜>"

]

}