173 6 months ago

This model uses IQ4_XS quantization and is from https://huggingface.co/mradermacher/XiYanSQL-QwenCoder-32B-2504-GGUF. Running the model requires 23.5 GB of GPU memory.

ollama run Kaiyue/xiyansql-32b

Details

6 months ago

97ec39aadca4 · 18GB ·

qwen2
·
32.8B
·
IQ4_XS
{{ .Prompt }}\n--end-of-sql--
{ "num_ctx": 32768, "num_gpu": 75, "stop": [ "--end-of-sql--" ] }

Readme

The model can “translate” queries from natural language to SQL. Ex: prompt: "Only generate a single SQL query that returns each customer's total expense and number of orders, sorted by expense descending. Do not include extra examples or alternative interpretations." Generated SQL: SELECT c.name, SUM(o.total) AS total_expense, COUNT(o.id) AS number_of_orders FROM customers c JOIN orders o ON c.id = o.customer_id GROUP BY c.id ORDER BY total_expense DESC;

I would like to thank mradermacher and XGenerationLab for their contributions to the XiYanSQL-QwenCoder-32B-2504 models hosted on Hugging Face.