542 1 month ago

Qwen3.5-35B-A3B APEX GGUF -- A Novel MoE-Aware Mixed-Precision Quantization Technique Brought to you by the LocalAI team -- the creators of LocalAI the open-source AI engine that runs any model - LLMs, vision, image - on any hardware.

vision tools thinking
ollama run fredrezones55/Qwen3.5-APEX

Applications

Claude Code
Claude Code ollama launch claude --model fredrezones55/Qwen3.5-APEX
OpenClaw
OpenClaw ollama launch openclaw --model fredrezones55/Qwen3.5-APEX
Hermes Agent
Hermes Agent ollama launch hermes --model fredrezones55/Qwen3.5-APEX
Codex
Codex ollama launch codex --model fredrezones55/Qwen3.5-APEX
OpenCode
OpenCode ollama launch opencode --model fredrezones55/Qwen3.5-APEX

Models

View all →

Readme

Optimized Qwen3.5:35B MoE model with full vision support with GGUF based model.

My pipeline needed qwen35moe patching, but gguf model blob is fully functioning with vision and tooling. Ollama will not stop finetunes from showing their full potential.