13 5 days ago

Fine-tuned Qwen3.5-9B with distilled reasoning from research-backed datasets. R5 was the first round to use production-quality data sources (Bespoke-Stratos, Tulu-3, SlimOrca) and achieved 84.2% on diverse eval — surpassing the base model.

tools thinking
ollama run robit/qwen3.5-9b-r5-research:q4km

Applications

Claude Code
Claude Code ollama launch claude --model robit/qwen3.5-9b-r5-research:q4km
Codex
Codex ollama launch codex --model robit/qwen3.5-9b-r5-research:q4km
OpenCode
OpenCode ollama launch opencode --model robit/qwen3.5-9b-r5-research:q4km
OpenClaw
OpenClaw ollama launch openclaw --model robit/qwen3.5-9b-r5-research:q4km

Models

View all →

Readme

r5_research_nutrition_label.png


Qwen3.5-9B R5 Research (Q4_K_M)

Fine-tuned Qwen3.5-9B with distilled reasoning from research-backed datasets. R5 was the first round to use production-quality data sources (Bespoke-Stratos, Tulu-3, SlimOrca) and achieved 84.2% on diverse eval — surpassing the base model. Superseded by R7 (86.8%).

Capabilities

  • Thinking — produces structured reasoning in <think> blocks
  • Tool calling — structured tool_calls via Ollama /api/chat
  • Instruction following — concise answers, format constraints, system prompt adherence

Eval Results

Benchmark Score
Diverse stochastic eval (38 tests) 84.2%
Base qwen3.5:9b on same eval 79.0%

Training

Quickstart

ollama run robit/qwen3.5-9b-r5-research:q4km

Parameters

  • RENDERER qwen3.5 + PARSER qwen3.5
  • temperature 0.6, top_p 0.95
  • stop "<|im_end|>"

Note

R5 is superseded by robit/qwen3.5-9b-r7-research:q4km which adds PrimeIntellect data and scores 86.8%.

License

Derived from Qwen3.5-9B (Apache 2.0). Training data licenses vary by source.