Qwen2.5 Coder 32B with the corrected 128k context

tools

333 2 months ago

Readme

This uses Unsloth’s GGUF which fixes the context length (the official Ollama model is wrong).

https://huggingface.co/unsloth/Qwen2.5-Coder-32B-Instruct-128K-GGUF/blob/main/Qwen2.5-Coder-32B-Instruct-Q6_K.gguf