A custom model of Qwen2.5-coder:7B-Instruct with the Qwen2.5-coder:3B-instruct used as a speculative fill model to speed up inference. Primarily made for TabbyML Usage.
257 Pulls 1 Tag Updated 2 weeks ago
A Custom Qwen2.5-coder:14B-instruct model using a Qwen2.5-coder:3B-instruct model for Speculative Fill. Primary usage is for TabbyML.
118 Pulls 1 Tag Updated 1 week ago
FROM ./Qwen2.5-Coder-3B-Instruct-Q4_K_M.gguf TEMPLATE """{{ .Prompt }}""" PARAMETER temperature 0.4 PARAMETER top_p 0.9 PARAMETER top_k 40 PARAMETER repeat_penalty 1.15 PARAMETER mirostat 2 PARAMETER mirostat_eta 0.2 PARAMETER mirostat_tau 5.0 PARAMETER
34 Pulls 1 Tag Updated 4 days ago
Fully decensored Qwen2.5-Coder-3B-Instruct processed with Heretic abliteration. Achieves 3/100 refusals with an exceptionally low KL divergence of 0.0163 — near-zero model degradation on a consumer RTX 4060.
690 Pulls 1 Tag Updated 5 days ago
https://huggingface.co/bartowski/Qwen2.5-Coder-3B-Instruct-abliterated-GGUF
1,608 Pulls 22 Tags Updated 1 year ago
196 Pulls 1 Tag Updated 1 year ago
This repo contains the instruction-tuned 3B Qwen2.5-Coder model in the GGUF Format: https://huggingface.co/Qwen/Qwen2.5-Coder-3B-Instruct-GGUF/tree/main
253 Pulls 1 Tag Updated 12 months ago
Adapted for Cline tool / Roo Code use in VS Code fused model , hybrid of DeepSeekR1 and Qwen2.5 coder, from FuseAI/FuseO1-DeepSeekR1-Qwen2.5-Coder-32B-Preview.
4,803 Pulls 2 Tags Updated 1 year ago
Qwen2.5-Coder is the latest series of Code-Specific Qwen large language models available in [F16, q8_0, q6_K, q4_K_S]
3,621 Pulls 4 Tags Updated 1 year ago
Qwen2.5-Coder-32B-Instruct fine-tuned on a decontaminated version of the codeforces dataset.
749 Pulls 7 Tags Updated 1 year ago
Qwen2.5-Coder for Roo available in [F16, q8_0, q6_K, q4_K_S]
662 Pulls 4 Tags Updated 1 year ago
Qwen2.5 Coder 32B with the corrected 128k context
583 Pulls 1 Tag Updated 1 year ago
A Custom Model using Qwen2.5-coder:7B-instruct as a base, and adding my custom Qwen2.5-coder-3b-instuct-spec model which is a Qwen2.5-coder:3b-instruct model using qwen2.5-coder:1.5b model as speculative fill. VERY WIP...
68 Pulls 1 Tag Updated 1 week ago
14 Pulls 1 Tag Updated 2 months ago
Perfect size for 24GB GPUs!
604 Pulls 1 Tag Updated 1 year ago
Quantized version of Qwen2.5-32B optimized for tool usage with Cline / Roo Code and complex problem solving.
540 Pulls 1 Tag Updated 1 year ago