Coding-optimized variants of the official Qwen3.5 MoE models — full vision capability retained, tuned for precise code generation via lower temperature. Based on Alibaba's Qwen3.5 distributed through the Ollama registry.

Details

Updated 3 months ago

3 months ago

ad2feedda0f2 · 81GB ·

model

archqwen35moe

parameters125B

quantizationQ4_K_M

81GB

license

Apache License Version 2.0, January 2004 http://www.apache.org/licenses/ TERMS AND CONDITIONS FOR US

11kB

params

{ "num_ctx": 131072, "temperature": 0.6, "top_k": 20, "top_p": 0.95 }

61B

template

13B

Qwen3.5-Coder

Coding-optimized variants of the official Qwen3.5 MoE models — full vision capability retained, tuned for precise code generation via lower temperature. Based on Alibaba’s Qwen3.5 distributed through the Ollama registry.

Two tags available:

Tag	Parameters	Active/Token	Context	Vision
`mdq100/qwen3.5-coder:35b`	36.0B	~3B	262,144	Yes
`mdq100/qwen3.5-coder:122b`	125.1B	~3B	262,144	Yes

Same architecture. Same quantization. Coding-optimized temperature across both.

What is this?

Qwen3.5 is a hybrid Mixture-of-Experts model from Alibaba’s Qwen team featuring a novel Gated DeltaNet + sparse MoE architecture. Despite large total parameter counts, only ~3B are activated per token, making inference efficient regardless of model size.

These Coder variants take the official Ollama Qwen3.5 models and tune their parameters for coding workloads — lower temperature for more deterministic output, presence penalty to reduce repetition. Vision capability is fully preserved.

Why Coder?

OpenCode and similar coding tools don’t support per-session parameter overrides — they use whatever is baked into the Ollama model. These variants provide coding-optimized defaults without sacrificing any model capability.

Use :35b for fast, capable coding on standard hardware
Use :122b for maximum reasoning depth on high-memory systems

Architecture

Property	`:35b`	`:122b`
Architecture	qwen35moe	qwen35moe
Total parameters	36.0B	125.1B
Active per token	~3B	~3B
Context length	262,144	262,144
Embedding length	2048	3072
Quantization	Q4_K_M	Q4_K_M
Source	Ollama registry	Ollama registry

The 122b model has a larger embedding dimension (3072 vs 2048), enabling richer per-token representations and stronger reasoning depth for complex problems.

Capabilities

Completion
Vision (image input supported)
Tool calling
Thinking (extended reasoning mode)

Usage

35B — Fast coding

ollama pull mdq100/qwen3.5-coder:35b
ollama run mdq100/qwen3.5-coder:35b

Parameters:

temperature:      0.6
top_p:            0.95
top_k:            20

122B — Maximum reasoning

ollama pull mdq100/qwen3.5-coder:122b
ollama run mdq100/qwen3.5-coder:122b

Parameters:

temperature:      0.6
top_p:            0.95
top_k:            20
num_ctx:          131072

OpenCode

Project-level (`:35b`):

{
  "model": "ollama/mdq100/qwen3.5-coder:35b"
}

Project-level (`:122b` for complex tasks):

{
  "model": "ollama/mdq100/qwen3.5-coder:122b"
}

Benchmarks

Scores from the base Qwen3.5 model (BF16, full precision). Q4_K_M quantization may show minor variance (~1-2%).

Coding

Benchmark	Score
SWE-bench Verified	69.2
LiveCodeBench v6	74.6
CodeForces Rating	2028
FullStackBench (en)	58.1
Terminal Bench 2	40.5

Knowledge & Reasoning

Benchmark	Score
MMLU-Pro	85.3
MMLU-Redux	93.3
GPQA Diamond	84.2
HLE w/ CoT	22.4

Instruction Following

Benchmark	Score
IFEval	91.9
IFBench	70.2
MultiChallenge	60.0

Long Context

Benchmark	Score
LongBench v2	59.0
AA-LCR	58.5

Vision

Benchmark	Score
MMMU	81.4
MathVision	83.9
MMBenchEN-DEV-v1.1	91.5

Comparison: Coder vs Flash-Coder

Property	`mdq100/qwen3.5-coder`	`mdq100/qwen3.5-flash:35b-code`
Source	Ollama registry (official)	Unsloth GGUF
Vision	Yes	No
Quantization	Standard Q4_K_M	Unsloth Dynamic 2.0
Ollama validated	Yes	No
Tags	`:35b`, `:122b`	`:35b`

Use mdq100/qwen3.5-coder when you need vision support or prefer Ollama-validated builds. Use mdq100/qwen3.5-flash:35b-code for better quantization quality without vision.

Credits

Original model: Qwen3.5 by Alibaba Qwen Team
Base distribution: Ollama registry
Coder variant (coding-optimized parameters): packaged for Ollama + OpenCode