36 3 days ago

Fine-tuned Qwen3.5-9B with distilled reasoning from research-backed datasets. Trained via LoRA SFT with an additive data strategy that preserves base model capabilities while improving instruction following and reasoning.

tools thinking
ollama run robit/qwen3.5-9b-r7-research:q4km

Details

3 days ago

e861d95d6639 · 5.6GB ·

qwen35
·
8.95B
·
Q4_K_M
{ "stop": [ "<|im_end|>" ], "temperature": 0.6, "top_p": 0.95 }

Readme

r7_research_nutrition_label.png


Qwen3.5-9B R7 Research (Q4_K_M)

Fine-tuned Qwen3.5-9B with distilled reasoning from research-backed datasets. Trained via LoRA SFT with an additive data strategy that preserves base model capabilities while improving instruction following and reasoning.

Capabilities

  • Thinking — produces structured reasoning in <think> blocks
  • Tool calling — structured tool_calls via Ollama /api/chat with tools parameter
  • Instruction following — concise answers, format constraints, system prompt adherence

Eval Results

Benchmark Score
Diverse stochastic eval (38 tests, 9 categories) 86.8%
Base qwen3.5:9b on same eval 79.0%

Training

Quickstart

ollama run robit/qwen3.5-9b-r7-research:q4km

Parameters

  • RENDERER qwen3.5 + PARSER qwen3.5 (enables tool calling)
  • temperature 0.6, top_p 0.95
  • stop "<|im_end|>"

License

Derived from Qwen3.5-9B (Apache 2.0). Training data licenses vary by source.