ukjin/Qwen3-30B-A3B-Thinking-2507-Deepseek-v3.1-Distill:4b/params

ukjin/ Qwen3-30B-A3B-Thinking-2507-Deepseek-v3.1-Distill:4b

1,417 Downloads Updated 5 months ago

This model is a distilled version of Qwen/Qwen3-30B-A3B-Instruct designed to inherit the reasoning and behavioral characteristics of its much larger teacher model, deepseek-ai/DeepSeek-V3.1.

tools thinking 4b

Qwen3-30B-A3B-Thinking-2507-Deepseek-v3.1-Distill:4b ... /

params

cff3f395ef37 · 120B

{

"repeat_penalty": 1,

"stop": [

"<|im_start|>",

"<|im_end|>"

],

"temperature": 0.6,

"top_k": 20,

"top_p": 0.95

}