-
qwen2.5
Qwen2.5 models are pretrained on Alibaba's latest large-scale dataset, encompassing up to 18 trillion tokens. The model supports up to 128K tokens and has multilingual support.
tools 0.5b 1.5b 3b 7b 14b 32b 72b21.5M Pulls 133 Tags Updated 1 year ago
-
qwen3
Qwen3 is the latest generation of large language models in Qwen series, offering a comprehensive suite of dense and mixture-of-experts (MoE) models.
tools thinking 0.6b 1.7b 4b 8b 14b 30b 32b 235b19.3M Pulls 58 Tags Updated 4 months ago
-
qwen2.5-coder
The latest series of Code-Specific Qwen models, with significant improvements in code generation, code reasoning, and code fixing.
tools 0.5b 1.5b 3b 7b 14b 32b11.1M Pulls 199 Tags Updated 8 months ago
-
qwen
Qwen 1.5 is a series of large language models by Alibaba Cloud spanning from 0.5B to 110B parameters
0.5b 1.8b 4b 7b 14b 32b 72b 110b5.5M Pulls 379 Tags Updated 1 year ago
-
qwen2
Qwen2 is a new series of large language models from Alibaba group
tools 0.5b 1.5b 7b 72b4.9M Pulls 97 Tags Updated 1 year ago
-
qwen3-coder
Alibaba's performant long context models for agentic and coding tasks.
tools cloud 30b 480b3M Pulls 10 Tags Updated 4 months ago
-
qwen3-vl
The most powerful vision-language model in the Qwen model family to date.
vision tools thinking cloud 2b 4b 8b 30b 32b 235b1.5M Pulls 59 Tags Updated 3 months ago
-
qwen2.5vl
Flagship vision-language model of Qwen and also a significant leap from the previous Qwen2-VL.
vision 3b 7b 32b 72b1.3M Pulls 17 Tags Updated 9 months ago
-
qwen3-embedding
Building upon the foundational models of the Qwen3 series, Qwen3 Embedding provides a comprehensive range of text embeddings models in various sizes
embedding 0.6b 4b 8b797.6K Pulls 12 Tags Updated 4 months ago
-
codeqwen
CodeQwen1.5 is a large language model pretrained on a large amount of code data.
7b441.6K Pulls 30 Tags Updated 1 year ago
-
qwen2-math
Qwen2 Math is a series of specialized math language models built upon the Qwen2 LLMs, which significantly outperforms the mathematical capabilities of open-source models and even closed-source models (e.g., GPT4o).
1.5b 7b 72b429.4K Pulls 52 Tags Updated 1 year ago
-
qwen3-next
The first installment in the Qwen3-Next series with strong performance in terms of both parameter efficiency and inference speed.
tools thinking cloud 80b331.5K Pulls 10 Tags Updated 2 months ago
-
qwen3-coder-next
Qwen3-Coder-Next is a coding-focused language model from Alibaba's Qwen team, optimized for agentic coding workflows and local development.
tools cloud88.4K Pulls 4 Tags Updated 1 week ago
-
qwen3.5
The Qwen3.5 series 397B-A17B native vision-language model is built on a hybrid architecture that integrates a linear attention mechanism with a sparse mixture-of-experts model, achieving higher inference efficiency.
vision tools thinking cloud6,758 Pulls 2 Tags Updated 18 hours ago