qwen3-coder:30b-a3b-q8_0

497.2K 3 weeks ago

Alibaba's performant long context models for agentic and coding tasks.

tools cloud 30b 480b

3 weeks ago

7b438a19895a · 32GB ·

qwen3moe
·
30.5B
·
Q8_0
Apache License Version 2.0, January 2004 http://www.apache.org/licenses/ TERMS AND CONDITIONS FOR US
{ "repeat_penalty": 1.05, "stop": [ "<|im_start|>", "<|im_end|>", "<

Readme

Qwen 3 logo Qwen3-Coder is the most agentic code model to date in the Qwen series.

Get started

480B

Cloud

ollama run qwen3-coder:480b-cloud

Local

ollama run qwen3-coder:480b

Running locally requires a minimum of 250GB of memory or unified memory.

30B

ollama run qwen3-coder:30b

Overview

qwen3-coder:30b offers 30B total parameters with only 3.3B activated, delivering strong performance while maintaining efficiency.

  • Exceptional agentic capabilities for real-world software engineering tasks through advanced long-horizon reinforcement learning on SWE-Bench and similar benchmarks.
  • Long context support with 256K tokens natively and up to 1M tokens using extrapolation methods, optimized for repository-scale understanding.
  • Scaled pretraining on 7.5T tokens (70% code ratio) while preserving strong general and mathematical abilities.
  • Execution-driven reinforcement learning that significantly boosts code execution success rates across diverse real-world coding tasks.

image.png

Reference