robit/qwen3.5-9b-r7-research:q4km/params

robit/ qwen3.5-9b-r7-research:q4km

127 Downloads Updated 3 months ago

Fine-tuned Qwen3.5-9B with distilled reasoning from research-backed datasets. Trained via LoRA SFT with an additive data strategy that preserves base model capabilities while improving instruction following and reasoning.

tools thinking

qwen3.5-9b-r7-research:q4km ... /

params

31879571b27f · 65B

{

"stop": [

"<|im_end|>"

],

"temperature": 0.6,

"top_p": 0.95

}