2,054 3 weeks ago

Unsloth Dynamic 2.0 Quants achieves 1M tokens & superior accuracy & SOTA quantization performance. Select UD-IQ3_XXS for 16GB VRAM, UD-Q4_K_XL for 24GB VRAM, or UD-Q5_K_XL/UD-Q6_K_XL for 32GB VRAM.

tools

Models

View all →

Readme

Qwen3 Coder 30B model for consumer-grade graphics cards, Unsloth Dynamic 2.0 quantized version with 1M tokens. For graphics cards with 16GB of VRAM, the UD-IQ3_XXS is recommended. For graphics cards with 24GB of VRAM, the UD-Q4_K_XL is recommended. For graphics cards with 32GB of VRAM, the UD-Q5_K_XL is recommended. Remember using Environment Variable OLLAMA_CONTEXT_LENGTH to adjust context length. With 16GB of VRAM and UD-IQ3_XXS, the recommended context length is 8192.