This model uses IQ4_XS quantization and is from https://huggingface.co/mradermacher/XiYanSQL-QwenCoder-32B-2504-GGUF. Running the model requires 23.5 GB of GPU memory.
213 Pulls 1 Tag Updated 9 months ago