184 1 week ago

Fine-tuned Qwen3.5-9B with distilled reasoning and full vision support. 883 tensors — vision tower preserved byte-for-byte from base. R5 was the first vision-capable distilled model.

vision tools thinking
57dbcd7decbc · 82B
{
"num_ctx": 131072,
"stop": [
"<|im_end|>"
],
"temperature": 0.6,
"top_p": 0.95
}