Qwen2.5 models are pretrained on Alibaba's latest large-scale dataset, encompassing up to 18 trillion tokens. The model supports up to 128K tokens and has multilingual support.
26.8M Pulls 133 Tags Updated 1 year ago
Qwen 3.5 is a family of open-source multimodal models that delivers exceptional utility and performance.
4.9M Pulls 58 Tags Updated 4 days ago
Qwen3-Coder-Next is a coding-focused language model from Alibaba's Qwen team, optimized for agentic coding workflows and local development.
1M Pulls 4 Tags Updated 1 month ago
The most powerful vision-language model in the Qwen model family to date.
2.9M Pulls 59 Tags Updated 5 months ago
The first installment in the Qwen3-Next series with strong performance in terms of both parameter efficiency and inference speed.
488.3K Pulls 10 Tags Updated 3 months ago
Qwen3 is the latest generation of large language models in Qwen series, offering a comprehensive suite of dense and mixture-of-experts (MoE) models.
25.9M Pulls 58 Tags Updated 5 months ago
Alibaba's performant long context models for agentic and coding tasks.
4.4M Pulls 10 Tags Updated 6 months ago
Building upon the foundational models of the Qwen3 series, Qwen3 Embedding provides a comprehensive range of text embeddings models in various sizes
1.6M Pulls 12 Tags Updated 6 months ago
Flagship vision-language model of Qwen and also a significant leap from the previous Qwen2-VL.
1.7M Pulls 17 Tags Updated 10 months ago
The latest series of Code-Specific Qwen models, with significant improvements in code generation, code reasoning, and code fixing.
13.6M Pulls 199 Tags Updated 10 months ago
Qwen2 is a new series of large language models from Alibaba group
5.5M Pulls 97 Tags Updated 1 year ago
Qwen 1.5 is a series of large language models by Alibaba Cloud spanning from 0.5B to 110B parameters
6.3M Pulls 379 Tags Updated 1 year ago
Qwen2 Math is a series of specialized math language models built upon the Qwen2 LLMs, which significantly outperforms the mathematical capabilities of open-source models and even closed-source models (e.g., GPT4o).
864.3K Pulls 52 Tags Updated 1 year ago
A fine-tuned version of Deepseek-R1-Distilled-Qwen-1.5B that surpasses the performance of OpenAI’s o1-preview with just 1.5B parameters on popular math evaluations.
1.2M Pulls 5 Tags Updated 1 year ago
Qwen3-Coder-uncensored
1,021 Pulls 1 Tag Updated 2 months ago
Your endpoint management badass. Fine-tuned Qwen 3 8B for Microsoft Intune, PowerShell 7, DSC v3, Graph API, Entra ID, and security baselines.
10 Pulls 3 Tags Updated 33 minutes ago
A Quantized, Fine-Tuned Model for Enhanced Tool Calling, Code Generation, and Reasoning
1,157 Pulls 1 Tag Updated 2 months ago
Ultra long-context model supporting 1M tokens with uncensored outputs, ideal for analyzing entire books, codebases, and extensive documents.
1,184 Pulls 1 Tag Updated 4 months ago
244 Pulls 1 Tag Updated 1 month ago
Qwen3-Coder featuring the following key enhancements: Significant Performance, Long-context Capabilities, Agentic Coding.
10K Pulls 9 Tags Updated 7 months ago