Qwen2.5 models are pretrained on Alibaba's latest large-scale dataset, encompassing up to 18 trillion tokens. The model supports up to 128K tokens and has multilingual support.
32M Pulls 133 Tags Updated 1 year ago
The latest series of Code-Specific Qwen models, with significant improvements in code generation, code reasoning, and code fixing.
16.3M Pulls 199 Tags Updated 1 year ago
Qwen2 is a new series of large language models from Alibaba group
5.9M Pulls 97 Tags Updated 1 year ago
Flagship vision-language model of Qwen and also a significant leap from the previous Qwen2-VL.
2.2M Pulls 17 Tags Updated 1 year ago
Qwen2 Math is a series of specialized math language models built upon the Qwen2 LLMs, which significantly outperforms the mathematical capabilities of open-source models and even closed-source models (e.g., GPT4o).
1M Pulls 52 Tags Updated 1 year ago
A new small reasoning model fine-tuned from the Qwen 2.5 3B Instruct model.
247.3K Pulls 5 Tags Updated 1 year ago
A custom model of Qwen2.5-coder:7B-Instruct with the Qwen2.5-coder:3B-instruct used as a speculative fill model to speed up inference. Primarily made for TabbyML Usage.
541 Pulls 1 Tag Updated 3 weeks ago
A Custom Qwen2.5-coder:14B-instruct model using a Qwen2.5-coder:3B-instruct model for Speculative Fill. Primary usage is for TabbyML.
262 Pulls 1 Tag Updated 3 weeks ago
A lightweight distilled Qwen2.5-based local model tuned for fast inference, general-purpose chat, and efficient on-device use. Good for everyday assistance, concise reasoning, and low-footprint deployments.
76 Pulls 1 Tag Updated 1 week ago
FROM ./Qwen2.5-Coder-3B-Instruct-Q4_K_M.gguf TEMPLATE """{{ .Prompt }}""" PARAMETER temperature 0.4 PARAMETER top_p 0.9 PARAMETER top_k 40 PARAMETER repeat_penalty 1.15 PARAMETER mirostat 2 PARAMETER mirostat_eta 0.2 PARAMETER mirostat_tau 5.0 PARAMETER
81 Pulls 1 Tag Updated 2 weeks ago
https://huggingface.co/Alibaba-NLP/gte-Qwen2-1.5B-instruct
18.5K Pulls 1 Tag Updated 2 months ago
A Custom Qwen2.5-coder:3b-instruct using a Qwen2.5-coder:1.5b model as speculative fill. Primary usage for this model is TabbyML.
76 Pulls 1 Tag Updated 3 weeks ago
38 Pulls 1 Tag Updated 2 weeks ago
This model has been finetuned with data from Agentic Signal (a visual AI agent workflow automation platform with local LLM integration).
36 Pulls 1 Tag Updated 2 weeks ago
qwen2.5-7B-instruct-Q4_K_M
4,896 Pulls 1 Tag Updated 3 months ago
A concise, root-cause-first coding assistant built on Qwen2.5-Coder 7B. Debug and generate code with zero filler — runs entirely locally via Ollama.
26 Pulls 1 Tag Updated 1 week ago
Fully decensored Qwen2.5-3B-Instruct processed with Heretic abliteration. Achieves 3/100 refusals with 0.11 KL divergence — 97% censorship removal on a consumer RTX 4060. GGUF format, ready to run.
986 Pulls 1 Tag Updated 2 weeks ago
日语汉化翻译LLM
1,498 Pulls 3 Tags Updated 5 months ago
Fully decensored Qwen2.5-Coder-3B-Instruct processed with Heretic abliteration. Achieves 3/100 refusals with an exceptionally low KL divergence of 0.0163 — near-zero model degradation on a consumer RTX 4060.
777 Pulls 1 Tag Updated 2 weeks ago
NovaForge AI – Qwen 2.5-3B Optimized A CPU-optimized, lightweight, general-purpose AI model built on Qwen 2.5-3B, designed for fast and private local inference on low-resource systems.
1,068 Pulls 1 Tag Updated 5 months ago