qwen2

qwen2.5

Qwen2.5 models are pretrained on Alibaba's latest large-scale dataset, encompassing up to 18 trillion tokens. The model supports up to 128K tokens and has multilingual support.

tools 0.5b 1.5b 3b 7b 14b 32b 72b

32.1M Pulls 133 Tags Updated 1 year ago

qwen2.5-coder

The latest series of Code-Specific Qwen models, with significant improvements in code generation, code reasoning, and code fixing.

tools 0.5b 1.5b 3b 7b 14b 32b

16.4M Pulls 199 Tags Updated 1 year ago

Qwen2 is a new series of large language models from Alibaba group

tools 0.5b 1.5b 7b 72b

5.9M Pulls 97 Tags Updated 1 year ago

qwen2.5vl

Flagship vision-language model of Qwen and also a significant leap from the previous Qwen2-VL.

vision 3b 7b 32b 72b

2.3M Pulls 17 Tags Updated 1 year ago

qwen2-math

Qwen2 Math is a series of specialized math language models built upon the Qwen2 LLMs, which significantly outperforms the mathematical capabilities of open-source models and even closed-source models (e.g., GPT4o).

1.5b 7b 72b

1M Pulls 52 Tags Updated 1 year ago

smallthinker

A new small reasoning model fine-tuned from the Qwen 2.5 3B Instruct model.

3b

247.4K Pulls 5 Tags Updated 1 year ago

jsck5147/nexo

qwen2.5-coder:3b

tools

5 Pulls 1 Tag Updated yesterday

cieloforge/qwen2.5-coder-7b-instruct-spec

A custom model of Qwen2.5-coder:7B-Instruct with the Qwen2.5-coder:3B-instruct used as a speculative fill model to speed up inference. Primarily made for TabbyML Usage.

tools

580 Pulls 1 Tag Updated 3 weeks ago

cieloforge/qwen2.5-14B-instruct-spec

A Custom Qwen2.5-coder:14B-instruct model using a Qwen2.5-coder:3B-instruct model for Speculative Fill. Primary usage is for TabbyML.

tools

281 Pulls 1 Tag Updated 3 weeks ago

jarvis-tech/Jarvis_Distilled_v1

A lightweight distilled Qwen2.5-based local model tuned for fast inference, general-purpose chat, and efficient on-device use. Good for everyday assistance, concise reasoning, and low-footprint deployments.

tools

86 Pulls 1 Tag Updated 1 week ago

benjaminjodom45/qwen25coder3b

FROM ./Qwen2.5-Coder-3B-Instruct-Q4_K_M.gguf TEMPLATE """{{ .Prompt }}""" PARAMETER temperature 0.4 PARAMETER top_p 0.9 PARAMETER top_k 40 PARAMETER repeat_penalty 1.15 PARAMETER mirostat 2 PARAMETER mirostat_eta 0.2 PARAMETER mirostat_tau 5.0 PARAMETER

tools

89 Pulls 1 Tag Updated 2 weeks ago

miti99/gte-qwen2

https://huggingface.co/Alibaba-NLP/gte-Qwen2-1.5B-instruct

18.5K Pulls 1 Tag Updated 2 months ago

cieloforge/qwen2.5-coder-3b-instruct-spec

A Custom Qwen2.5-coder:3b-instruct using a Qwen2.5-coder:1.5b model as speculative fill. Primary usage for this model is TabbyML.

tools

82 Pulls 1 Tag Updated 3 weeks ago

junquan2k/Qwen2.5-VL-7B-local9-0414

vision

49 Pulls 1 Tag Updated 2 weeks ago

code-forge-temple/agentic-signal-qwen2.5-coder

This model has been finetuned with data from Agentic Signal (a visual AI agent workflow automation platform with local LLM integration).

7b

38 Pulls 1 Tag Updated 3 weeks ago

qcwind/qwen2.5-7B-instruct-Q4_K_M

qwen2.5-7B-instruct-Q4_K_M

tools

4,961 Pulls 1 Tag Updated 3 months ago

sssonusharma1999/ankit-coder

A concise, root-cause-first coding assistant built on Qwen2.5-Coder 7B. Debug and generate code with zero filler — runs entirely locally via Ollama.

28 Pulls 1 Tag Updated 1 week ago

r4c3r/qwen2.5-3b-heretic

Fully decensored Qwen2.5-3B-Instruct processed with Heretic abliteration. Achieves 3/100 refusals with 0.11 KL divergence — 97% censorship removal on a consumer RTX 4060. GGUF format, ready to run.

1,011 Pulls 1 Tag Updated 2 weeks ago

quantumcookie/Sakura-qwen2.5-v1.0

日语汉化翻译LLM

1.5b 7b 14b

1,516 Pulls 3 Tags Updated 5 months ago

r4c3r/qwen2.5-coder-3b-heretic

Fully decensored Qwen2.5-Coder-3B-Instruct processed with Heretic abliteration. Achieves 3/100 refusals with an exceptionally low KL divergence of 0.0163 — near-zero model degradation on a consumer RTX 4060.

799 Pulls 1 Tag Updated 2 weeks ago