84 3 weeks ago

Qwen3-Thinking-2507 is the continuation of Qwen3 thinking model, with improved quality and depth of reasoning. Qwen3-Instruct-2507 is the updated version of the previous Qwen3 non-thinking mode. (quantized UD-Q4_K_XL, thinking and instruct versions)

tools thinking 30b

Models

View all →

Readme

Feature Value
VLM false
think by version
tools true
speed 94 token/s

The characteristics are approximate, tested on rtx3090 24GB