基于 DeepSeek-R1-Distill-Qwen-1.5B 微调的中文轻量对话模型,自带猫娘口癖与亲昵风格。
398 Pulls 1 Tag Updated 8 months ago
DeepSeek-R1-Distill-Qwen-1.5B
4,728 Pulls 1 Tag Updated 1 year ago
1,250 Pulls 5 Tags Updated 1 year ago
382 Pulls 1 Tag Updated 1 year ago
DeepScaleR-1.5B-Preview is a language model fine-tuned from DeepSeek-R1-Distilled-Qwen-1.5B using distributed reinforcement learning (RL)
108 Pulls 1 Tag Updated 1 year ago
82 Pulls 1 Tag Updated 1 year ago
1.5b model
74 Pulls 1 Tag Updated 1 year ago
Brainstorm 40x by DavidAU available in [F16, q8_0, q6_K, q4_K_S]
262 Pulls 4 Tags Updated 1 year ago