47 Downloads Updated 5 days ago
ollama run batiai/qwen3.5-35b:iq4
ollama launch claude --model batiai/qwen3.5-35b:iq4
ollama launch codex --model batiai/qwen3.5-35b:iq4
ollama launch opencode --model batiai/qwen3.5-35b:iq4
ollama launch openclaw --model batiai/qwen3.5-35b:iq4
Quantized from official Alibaba weights. Verified on real Mac hardware.
| Tag | Size | VRAM | M4 Max (128GB) | Use Case |
|---|---|---|---|---|
| iq4 | 17GB | 23GB | 26.6 t/s | 36GB+ Mac |
ollama run batiai/qwen3.5-35b:iq4
| 35B-A3B (MoE) | 27B (Dense) | |
|---|---|---|
| Total params | 35B | 27B |
| Active params | 3B | 27B |
| VRAM | 23GB | 28GB |
| Speed | 26.6 t/s | 17.0 t/s |
MoE only activates 3B params per token — 9x less compute than 27B Dense. Same quality, much faster.
| Your Mac RAM | IQ4 (17GB) |
|---|---|
| 16GB | ❌ |
| 32GB | ⚠️ Tight (23GB VRAM) |
| 36GB+ | ✅ Fits |
| 48GB+ | ✅ Fast |
| 128GB | 26.6 t/s |
| Model | Size | VRAM | Speed (M4 Max) | Min Mac |
|---|---|---|---|---|
| batiai/qwen3.5-9b:q4 | 5.2GB | ~8GB | 12.5 t/s | 16GB |
| batiai/qwen3.5-27b:iq4 | 14GB | 28GB | 17.0 t/s | 32GB |
| batiai/qwen3.5-35b:iq4 | 17GB | 23GB | 26.6 t/s | 36GB |
For 36GB+ Mac, the 35B MoE is the clear winner — faster and less VRAM than 27B.
Free, on-device AI automation for Mac. 5MB app, 100% local, unlimited.