qwen

qwen3.6

Qwen3.6 delivers substantial upgrades in agentic coding and thinking preservation than previous Qwen models.

vision tools thinking 27b 35b

836K Pulls 22 Tags Updated 1 week ago

qwen3.5

Qwen 3.5 is a family of open-source multimodal models that delivers exceptional utility and performance.

vision tools thinking cloud 0.8b 2b 4b 9b 27b 35b 122b

8.4M Pulls 58 Tags Updated 1 month ago

qwen3-coder-next

Qwen3-Coder-Next is a coding-focused language model from Alibaba's Qwen team, optimized for agentic coding workflows and local development.

tools cloud

1.2M Pulls 4 Tags Updated 2 months ago

qwen3-next

The first installment in the Qwen3-Next series with strong performance in terms of both parameter efficiency and inference speed.

tools thinking cloud 80b

536.5K Pulls 10 Tags Updated 4 months ago

qwen3-vl

The most powerful vision-language model in the Qwen model family to date.

vision tools thinking cloud 2b 4b 8b 30b 32b 235b

3.6M Pulls 59 Tags Updated 6 months ago

qwen3-coder

Alibaba's performant long context models for agentic and coding tasks.

tools cloud 30b 480b

5.2M Pulls 10 Tags Updated 7 months ago

qwen3-embedding

Building upon the foundational models of the Qwen3 series, Qwen3 Embedding provides a comprehensive range of text embeddings models in various sizes

embedding 0.6b 4b 8b

1.8M Pulls 12 Tags Updated 7 months ago

qwen2.5vl

Flagship vision-language model of Qwen and also a significant leap from the previous Qwen2-VL.

vision 3b 7b 32b 72b

1.9M Pulls 17 Tags Updated 11 months ago

qwen3

Qwen3 is the latest generation of large language models in Qwen series, offering a comprehensive suite of dense and mixture-of-experts (MoE) models.

tools thinking 0.6b 1.7b 4b 8b 14b 30b 32b 235b

28.3M Pulls 58 Tags Updated 6 months ago

qwen2.5

Qwen2.5 models are pretrained on Alibaba's latest large-scale dataset, encompassing up to 18 trillion tokens. The model supports up to 128K tokens and has multilingual support.

tools 0.5b 1.5b 3b 7b 14b 32b 72b

29.2M Pulls 133 Tags Updated 1 year ago

qwen2.5-coder

The latest series of Code-Specific Qwen models, with significant improvements in code generation, code reasoning, and code fixing.

tools 0.5b 1.5b 3b 7b 14b 32b

15.1M Pulls 199 Tags Updated 11 months ago

qwen2

Qwen2 is a new series of large language models from Alibaba group

tools 0.5b 1.5b 7b 72b

5.8M Pulls 97 Tags Updated 1 year ago

Qwen 1.5 is a series of large language models by Alibaba Cloud spanning from 0.5B to 110B parameters

0.5b 1.8b 4b 7b 14b 32b 72b 110b

6.6M Pulls 379 Tags Updated 2 years ago

qwen2-math

Qwen2 Math is a series of specialized math language models built upon the Qwen2 LLMs, which significantly outperforms the mathematical capabilities of open-source models and even closed-source models (e.g., GPT4o).

1.5b 7b 72b

1M Pulls 52 Tags Updated 1 year ago

qwq

QwQ is the reasoning model of the Qwen series.

tools 32b

2.2M Pulls 8 Tags Updated 1 year ago

deepscaler

A fine-tuned version of Deepseek-R1-Distilled-Qwen-1.5B that surpasses the performance of OpenAI’s o1-preview with just 1.5B parameters on popular math evaluations.

1.5b

1.2M Pulls 5 Tags Updated 1 year ago

codeqwen

CodeQwen1.5 is a large language model pretrained on a large amount of code data.

7b

1M Pulls 30 Tags Updated 1 year ago

smallthinker

A new small reasoning model fine-tuned from the Qwen 2.5 3B Instruct model.

3b

242.1K Pulls 5 Tags Updated 1 year ago

QwertyMcQwertz/MutualistLLM

MutualistLLM is a local model for deconstructing reactionary narratives and capitalist ideology, offering grounded, coherent responses rooted in anarchist and socialist theory.

36 Pulls 1 Tag Updated 3 months ago

mrthp/omnicoder2

2nd gen OmniCoder, fine-tuned from Qwen3.5-9B. Trains on assistant tokens only (unlike v1): no more repetition loops, stable tool-calling in long agentic sessions. With a better prompt in general.

tools thinking

205 Pulls 1 Tag Updated 4 days ago