This model uses IQ4_XS quantization and is from https://huggingface.co/mradermacher/XiYanSQL-QwenCoder-32B-2504-GGUF. Running the model requires 23.5 GB of GPU memory.
173 Pulls 1 Tag Updated 6 months ago