15 6 days ago

Fine-tuned Qwen3.5-9B with distilled reasoning from research-backed datasets. R5 was the first round to use production-quality data sources (Bespoke-Stratos, Tulu-3, SlimOrca) and achieved 84.2% on diverse eval — surpassing the base model.

tools thinking
57dbcd7decbc · 82B
{
"num_ctx": 131072,
"stop": [
"<|im_end|>"
],
"temperature": 0.6,
"top_p": 0.95
}