173 6 months ago

This model uses IQ4_XS quantization and is from https://huggingface.co/mradermacher/XiYanSQL-QwenCoder-32B-2504-GGUF. Running the model requires 23.5 GB of GPU memory.

b0816db97fde · 57B
{
"num_ctx": 32768,
"num_gpu": 75,
"stop": [
"--end-of-sql--"
]
}