The latest series of Code-Specific Qwen models, with significant improvements in code generation, code reasoning, and code fixing.
561.6K Pulls 196 Tags Updated 12 days ago
Qwen 1.5 is a series of large language models by Alibaba Cloud spanning from 0.5B to 110B parameters
4.1M Pulls 379 Tags Updated 7 months ago
Qwen2 is a new series of large language models from Alibaba group
3.9M Pulls 97 Tags Updated 2 months ago
Qwen2.5 models are pretrained on Alibaba's latest large-scale dataset, encompassing up to 18 trillion tokens. The model supports up to 128K tokens and has multilingual support.
2.1M Pulls 133 Tags Updated 2 months ago
Qwen2 Math is a series of specialized math language models built upon the Qwen2 LLMs, which significantly outperforms the mathematical capabilities of open-source models and even closed-source models (e.g., GPT4o).
102.6K Pulls 52 Tags Updated 2 months ago
CodeQwen1.5 is a large language model pretrained on a large amount of code data.
116K Pulls 30 Tags Updated 5 months ago
MiniCPM-V surpasses proprietary models such as GPT-4V, Gemini Pro, Qwen-VL and Claude 3 in overall performance, and support multimodal conversation for over 30 languages.
38K Pulls 8 Tags Updated 5 months ago
14.5K Pulls 13 Tags Updated 5 months ago
4,744 Pulls 1 Tag Updated 4 months ago
Mixture-of-Experts model 57b
3,662 Pulls 18 Tags Updated 3 months ago
Qwen2.5 coder tools model can work with Cline (prev. Claude Dev). Update 0.5b, 1.5b, 3b, 7b, 14b, 32b coder models.
2,933 Pulls 15 Tags Updated 12 days ago
Qwen2 with tools enabled
1,523 Pulls 3 Tags Updated 3 months ago
Prompt helper for Stable Diffusion based on Qwen2-0.5B
976 Pulls 4 Tags Updated 4 months ago
Arcee-SuperNova-Medius is a 14B parameter language model developed by Arcee.ai, built on the Qwen2.5-14B-Instruct architecture.
641 Pulls 22 Tags Updated 6 weeks ago
This model is based on Qwen2-72b, Dolphin-2.9.2 has a variety of instruction, conversational, and coding skills. It also has initial agentic abilities and supports function calling. Dolphin is uncensored.
556 Pulls 20 Tags Updated 5 months ago
549 Pulls 24 Tags Updated 5 months ago
Qwen2.5 tools model can work with Cline (prev. Claude Dev).
546 Pulls 4 Tags Updated 7 weeks ago
Qwen 2.5 Coder 7b Instruct
540 Pulls 1 Tag Updated 2 months ago
Qwen2-Math is a series of specialized math language models built upon the Qwen2 LLMs
490 Pulls 21 Tags Updated 3 months ago
Initialized from Qwen2-7B, this model offers performance comparable to larger models while remaining efficient and fast. It's ideal for developers, researchers, and businesses seeking advanced AI solutions for function calling
449 Pulls 1 Tag Updated 4 months ago