Qwen2.5 models are pretrained on Alibaba's latest large-scale dataset, encompassing up to 18 trillion tokens. The model supports up to 128K tokens and has multilingual support.
32.1M Pulls 133 Tags Updated 1 year ago
The latest series of Code-Specific Qwen models, with significant improvements in code generation, code reasoning, and code fixing.
16.4M Pulls 199 Tags Updated 1 year ago
Qwen2 is a new series of large language models from Alibaba group
5.9M Pulls 97 Tags Updated 1 year ago
Flagship vision-language model of Qwen and also a significant leap from the previous Qwen2-VL.
2.3M Pulls 17 Tags Updated 1 year ago
Qwen2 Math is a series of specialized math language models built upon the Qwen2 LLMs, which significantly outperforms the mathematical capabilities of open-source models and even closed-source models (e.g., GPT4o).
1M Pulls 52 Tags Updated 1 year ago
A new small reasoning model fine-tuned from the Qwen 2.5 3B Instruct model.
247.4K Pulls 5 Tags Updated 1 year ago
qwen2.5-coder:3b
5 Pulls 1 Tag Updated yesterday
A custom model of Qwen2.5-coder:7B-Instruct with the Qwen2.5-coder:3B-instruct used as a speculative fill model to speed up inference. Primarily made for TabbyML Usage.
580 Pulls 1 Tag Updated 3 weeks ago
A Custom Qwen2.5-coder:14B-instruct model using a Qwen2.5-coder:3B-instruct model for Speculative Fill. Primary usage is for TabbyML.
281 Pulls 1 Tag Updated 3 weeks ago
A lightweight distilled Qwen2.5-based local model tuned for fast inference, general-purpose chat, and efficient on-device use. Good for everyday assistance, concise reasoning, and low-footprint deployments.
86 Pulls 1 Tag Updated 1 week ago
FROM ./Qwen2.5-Coder-3B-Instruct-Q4_K_M.gguf TEMPLATE """{{ .Prompt }}""" PARAMETER temperature 0.4 PARAMETER top_p 0.9 PARAMETER top_k 40 PARAMETER repeat_penalty 1.15 PARAMETER mirostat 2 PARAMETER mirostat_eta 0.2 PARAMETER mirostat_tau 5.0 PARAMETER
89 Pulls 1 Tag Updated 2 weeks ago
https://huggingface.co/Alibaba-NLP/gte-Qwen2-1.5B-instruct
18.5K Pulls 1 Tag Updated 2 months ago
A Custom Qwen2.5-coder:3b-instruct using a Qwen2.5-coder:1.5b model as speculative fill. Primary usage for this model is TabbyML.
82 Pulls 1 Tag Updated 3 weeks ago
49 Pulls 1 Tag Updated 2 weeks ago
This model has been finetuned with data from Agentic Signal (a visual AI agent workflow automation platform with local LLM integration).
38 Pulls 1 Tag Updated 3 weeks ago
qwen2.5-7B-instruct-Q4_K_M
4,961 Pulls 1 Tag Updated 3 months ago
A concise, root-cause-first coding assistant built on Qwen2.5-Coder 7B. Debug and generate code with zero filler — runs entirely locally via Ollama.
28 Pulls 1 Tag Updated 1 week ago
Fully decensored Qwen2.5-3B-Instruct processed with Heretic abliteration. Achieves 3/100 refusals with 0.11 KL divergence — 97% censorship removal on a consumer RTX 4060. GGUF format, ready to run.
1,011 Pulls 1 Tag Updated 2 weeks ago
日语汉化翻译LLM
1,516 Pulls 3 Tags Updated 5 months ago
Fully decensored Qwen2.5-Coder-3B-Instruct processed with Heretic abliteration. Achieves 3/100 refusals with an exceptionally low KL divergence of 0.0163 — near-zero model degradation on a consumer RTX 4060.
799 Pulls 1 Tag Updated 2 weeks ago