466 1 year ago

Qwen2.5 Coder 32B with the corrected 128k context

tools
ollama run sammcj/qwen2.5-coder-32b-128k:q6_k

Applications

Claude Code
Claude Code ollama launch claude --model sammcj/qwen2.5-coder-32b-128k:q6_k
Codex
Codex ollama launch codex --model sammcj/qwen2.5-coder-32b-128k:q6_k
OpenCode
OpenCode ollama launch opencode --model sammcj/qwen2.5-coder-32b-128k:q6_k
OpenClaw
OpenClaw ollama launch openclaw --model sammcj/qwen2.5-coder-32b-128k:q6_k

Models

View all →

Readme

This uses Unsloth’s GGUF which fixes the context length (the official Ollama model is wrong).

https://huggingface.co/unsloth/Qwen2.5-Coder-32B-Instruct-128K-GGUF/blob/main/Qwen2.5-Coder-32B-Instruct-Q6_K.gguf