Qwen2.5 models are pretrained on Alibaba's latest large-scale dataset, encompassing up to 18 trillion tokens. The model supports up to 128K tokens and has multilingual support.
30.9M Pulls 133 Tags Updated 1 year ago
The latest series of Code-Specific Qwen models, with significant improvements in code generation, code reasoning, and code fixing.
15.8M Pulls 199 Tags Updated 12 months ago
Qwen2 is a new series of large language models from Alibaba group
5.9M Pulls 97 Tags Updated 1 year ago
Flagship vision-language model of Qwen and also a significant leap from the previous Qwen2-VL.
2M Pulls 17 Tags Updated 1 year ago
Qwen2 Math is a series of specialized math language models built upon the Qwen2 LLMs, which significantly outperforms the mathematical capabilities of open-source models and even closed-source models (e.g., GPT4o).
1M Pulls 52 Tags Updated 1 year ago
A new small reasoning model fine-tuned from the Qwen 2.5 3B Instruct model.
245.9K Pulls 5 Tags Updated 1 year ago
FROM ./Qwen2.5-Coder-3B-Instruct-Q4_K_M.gguf TEMPLATE """{{ .Prompt }}""" PARAMETER temperature 0.4 PARAMETER top_p 0.9 PARAMETER top_k 40 PARAMETER repeat_penalty 1.15 PARAMETER mirostat 2 PARAMETER mirostat_eta 0.2 PARAMETER mirostat_tau 5.0 PARAMETER
21 Pulls 1 Tag Updated 2 days ago
This model has been finetuned with data from Agentic Signal (a visual AI agent workflow automation platform with local LLM integration).
18 Pulls 1 Tag Updated 6 days ago
14 Pulls 1 Tag Updated 4 days ago
A finetuned version of Qwen2.5:0.5B to help find more of whats interesting.
8 Pulls 2 Tags Updated 5 days ago
A custom model of Qwen2.5-coder:7B-Instruct with the Qwen2.5-coder:3B-instruct used as a speculative fill model to speed up inference. Primarily made for TabbyML Usage.
213 Pulls 1 Tag Updated 1 week ago
A Custom Qwen2.5-coder:14B-instruct model using a Qwen2.5-coder:3B-instruct model for Speculative Fill. Primary usage is for TabbyML.
106 Pulls 1 Tag Updated 1 week ago
https://huggingface.co/Alibaba-NLP/gte-Qwen2-1.5B-instruct
18.5K Pulls 1 Tag Updated 1 month ago
A Custom Model using Qwen2.5-coder:7B-instruct as a base, and adding my custom Qwen2.5-coder-3b-instuct-spec model which is a Qwen2.5-coder:3b-instruct model using qwen2.5-coder:1.5b model as speculative fill. VERY WIP...
61 Pulls 1 Tag Updated 1 week ago
A Custom Qwen2.5-coder:3b-instruct using a Qwen2.5-coder:1.5b model as speculative fill. Primary usage for this model is TabbyML.
37 Pulls 1 Tag Updated 1 week ago
qwen2.5-7B-instruct-Q4_K_M
4,586 Pulls 1 Tag Updated 3 months ago
Fully decensored Qwen2.5-3B-Instruct processed with Heretic abliteration. Achieves 3/100 refusals with 0.11 KL divergence — 97% censorship removal on a consumer RTX 4060. GGUF format, ready to run.
798 Pulls 1 Tag Updated 3 days ago
日语汉化翻译LLM
1,362 Pulls 3 Tags Updated 4 months ago
Fully decensored Qwen2.5-Coder-3B-Instruct processed with Heretic abliteration. Achieves 3/100 refusals with an exceptionally low KL divergence of 0.0163 — near-zero model degradation on a consumer RTX 4060.
681 Pulls 1 Tag Updated 3 days ago
NovaForge AI – Qwen 2.5-3B Optimized A CPU-optimized, lightweight, general-purpose AI model built on Qwen 2.5-3B, designed for fast and private local inference on low-resource systems.
1,028 Pulls 1 Tag Updated 4 months ago