qwen

qwen3-vl

The most powerful vision-language model in the Qwen model family to date.

vision tools cloud 2b 4b 8b 30b 32b 235b

798.7K Pulls 59 Tags Updated 1 month ago

qwen3-coder

Alibaba's performant long context models for agentic and coding tasks.

tools cloud 30b 480b

1.3M Pulls 10 Tags Updated 2 months ago

qwen2.5

Qwen2.5 models are pretrained on Alibaba's latest large-scale dataset, encompassing up to 18 trillion tokens. The model supports up to 128K tokens and has multilingual support.

tools 0.5b 1.5b 3b 7b 14b 32b 72b

18.2M Pulls 133 Tags Updated 1 year ago

qwen3

Qwen3 is the latest generation of large language models in Qwen series, offering a comprehensive suite of dense and mixture-of-experts (MoE) models.

tools thinking 0.6b 1.7b 4b 8b 14b 30b 32b 235b

15.3M Pulls 58 Tags Updated 2 months ago

qwen2.5-coder

The latest series of Code-Specific Qwen models, with significant improvements in code generation, code reasoning, and code fixing.

tools 0.5b 1.5b 3b 7b 14b 32b

9.3M Pulls 199 Tags Updated 6 months ago

Qwen 1.5 is a series of large language models by Alibaba Cloud spanning from 0.5B to 110B parameters

0.5b 1.8b 4b 7b 14b 32b 72b 110b

5.1M Pulls 379 Tags Updated 1 year ago

qwen2

Qwen2 is a new series of large language models from Alibaba group

tools 0.5b 1.5b 7b 72b

4.5M Pulls 97 Tags Updated 1 year ago

qwen2.5vl

Flagship vision-language model of Qwen and also a significant leap from the previous Qwen2-VL.

vision 3b 7b 32b 72b

1.1M Pulls 17 Tags Updated 7 months ago

qwen3-next

The first installment in the Qwen3-Next series with strong performance in terms of both parameter efficiency and inference speed.

tools thinking cloud 80b

207.2K Pulls 10 Tags Updated 1 week ago

qwen2-math

Qwen2 Math is a series of specialized math language models built upon the Qwen2 LLMs, which significantly outperforms the mathematical capabilities of open-source models and even closed-source models (e.g., GPT4o).

1.5b 7b 72b

198.9K Pulls 52 Tags Updated 1 year ago

qwen3-embedding

Building upon the foundational models of the Qwen3 series, Qwen3 Embedding provides a comprehensive range of text embeddings models in various sizes

embedding 0.6b 4b 8b

194.8K Pulls 12 Tags Updated 2 months ago

qwq

QwQ is the reasoning model of the Qwen series.

tools 32b

1.9M Pulls 8 Tags Updated 9 months ago

deepscaler

A fine-tuned version of Deepseek-R1-Distilled-Qwen-1.5B that surpasses the performance of OpenAI’s o1-preview with just 1.5B parameters on popular math evaluations.

1.5b

871K Pulls 5 Tags Updated 10 months ago

codeqwen

CodeQwen1.5 is a large language model pretrained on a large amount of code data.

7b

210.2K Pulls 30 Tags Updated 1 year ago

smallthinker

A new small reasoning model fine-tuned from the Qwen 2.5 3B Instruct model.

3b

108K Pulls 5 Tags Updated 11 months ago

dengcao/Qwen3-Reranker-8B

Alibaba's text reranking model.Qwen3-Reranker-8B has the following features: Model Type: Text Reranking. Supported Languages: 100+ Languages. Number of Paramaters: 8B. Context Length: 32k.

196.4K Pulls 5 Tags Updated 6 months ago

hengwen/DeepSeek-R1-Distill-Qwen-32B

DeepSeek-R1-Distill models are fine-tuned based on open-source models, using samples generated by DeepSeek-R1. We slightly change their configs and tokenizers. Please use our setting to run these models.

126K Pulls 2 Tags Updated 11 months ago

hellonico/Qwen-2.5-Math-7.6B-Instruct-Q6_K.gguf

119.1K Pulls 1 Tag Updated 10 months ago

myaniu/qwen2.5-1m

Qwen2.5-7B/14B-Instruct-1M

tools 7b 14b

116.8K Pulls 11 Tags Updated 10 months ago

huihui_ai/qwen3-abliterated

Qwen3 is the latest generation of large language models in Qwen series, offering a comprehensive suite of dense and mixture-of-experts (MoE) models.

tools thinking 0.6b 1.7b 4b 8b 14b 16b 30b 32b 235b

105.6K Pulls 74 Tags Updated 4 months ago