tickling tensors
-
qwen3.5-9b-r5-vision
Fine-tuned Qwen3.5-9B with distilled reasoning and full vision support. 883 tensors — vision tower preserved byte-for-byte from base. R5 was the first vision-capable distilled model.
vision tools thinking151 Pulls 1 Tag Updated 5 days ago
-
qwen3.5-9b-r7-research-vision
Fine-tuned Qwen3.5-9B with distilled reasoning and full vision support. 883 tensors (427 text + 441 vision + 15 MTP) — vision tower preserved byte-for-byte from base via llama-export-lora merge.
vision tools thinking82 Pulls 1 Tag Updated 3 days ago
-
qwen3.5-9b-r7-research
Fine-tuned Qwen3.5-9B with distilled reasoning from research-backed datasets. Trained via LoRA SFT with an additive data strategy that preserves base model capabilities while improving instruction following and reasoning.
tools thinking36 Pulls 1 Tag Updated 3 days ago
-
qwen3.5-9b-r5-research
Fine-tuned Qwen3.5-9B with distilled reasoning from research-backed datasets. R5 was the first round to use production-quality data sources (Bespoke-Stratos, Tulu-3, SlimOrca) and achieved 84.2% on diverse eval — surpassing the base model.
tools thinking12 Pulls 1 Tag Updated 5 days ago