13 5 days ago

Fine-tuned Qwen3.5-9B with distilled reasoning from research-backed datasets. R5 was the first round to use production-quality data sources (Bespoke-Stratos, Tulu-3, SlimOrca) and achieved 84.2% on diverse eval — surpassing the base model.

tools thinking
ollama run robit/qwen3.5-9b-r5-research:q4km

Details

5 days ago

1df19b2a88ed · 5.6GB ·

qwen35
·
8.95B
·
Q4_K_M
{ "num_ctx": 131072, "stop": [ "<|im_end|>" ], "temperature": 0.6, "top_

Readme

r5_research_nutrition_label.png


Qwen3.5-9B R5 Research (Q4_K_M)

Fine-tuned Qwen3.5-9B with distilled reasoning from research-backed datasets. R5 was the first round to use production-quality data sources (Bespoke-Stratos, Tulu-3, SlimOrca) and achieved 84.2% on diverse eval — surpassing the base model. Superseded by R7 (86.8%).

Capabilities

  • Thinking — produces structured reasoning in <think> blocks
  • Tool calling — structured tool_calls via Ollama /api/chat
  • Instruction following — concise answers, format constraints, system prompt adherence

Eval Results

Benchmark Score
Diverse stochastic eval (38 tests) 84.2%
Base qwen3.5:9b on same eval 79.0%

Training

Quickstart

ollama run robit/qwen3.5-9b-r5-research:q4km

Parameters

  • RENDERER qwen3.5 + PARSER qwen3.5
  • temperature 0.6, top_p 0.95
  • stop "<|im_end|>"

Note

R5 is superseded by robit/qwen3.5-9b-r7-research:q4km which adds PrimeIntellect data and scores 86.8%.

License

Derived from Qwen3.5-9B (Apache 2.0). Training data licenses vary by source.