173 6 months ago

This model uses IQ4_XS quantization and is from https://huggingface.co/mradermacher/XiYanSQL-QwenCoder-32B-2504-GGUF. Running the model requires 23.5 GB of GPU memory.