DeepSeek-R1-Distill-Qwen-32B

wao/DeepSeek-R1-Distill-Qwen-32B-Japanese

Japanese instruction-tuned LLM by CyberAgent, distilled from Qwen-72B.

275 Pulls 1 Tag Updated 12 months ago

hengwen/DeepSeek-R1-Distill-Qwen-32B

DeepSeek-R1-Distill models are fine-tuned based on open-source models, using samples generated by DeepSeek-R1. We slightly change their configs and tokenizers. Please use our setting to run these models.

147.8K Pulls 2 Tags Updated 1 year ago

aia/DeepSeek-R1-Distill-Qwen-32B-Uncensored-i1

2,929 Pulls 1 Tag Updated 1 year ago

jerry0012000/DeepSeek-R1-Distill-Qwen-32B-Q4_K_M

717 Pulls 1 Tag Updated 1 year ago

bsahane/DeepSeek-R1-Distill-Qwen-32B

133 Pulls 1 Tag Updated 1 year ago

hellonico/DeepSeek-R1-Distill-Qwen-32B-Japanese

75 Pulls 1 Tag Updated 1 year ago

xitao/DeepSeek-R1-Distill-Qwen-32B-QMind

33 Pulls 2 Tags Updated 1 year ago

huihui_ai/deepseek-r1-Fusion

DeepSeek-R1-Distill-Qwen-Coder-32B-Fusion-9010 is a mixed model that combines the strengths of two powerful DeepSeek-R1-Distill-Qwen-based models: huihui-ai/DeepSeek-R1-Distill-Qwen-32B-abliterated and huihui-ai/Qwen2.5-Coder-32B-Instruct-abliterated.

32b

2,430 Pulls 6 Tags Updated 1 year ago