Non-thinking, Uses the Q8_0 quantized version of the official Qwen/Qwen3.5 model files, without any other modifications.
958 Pulls 5 Tags Updated 1 week ago
Uses the Q8_0 quantized version of the official Qwen/Qwen3.5 model files, without any other modifications.
376 Pulls 6 Tags Updated 1 week ago
Qwen3-4B Q5_K_XL Unsloth UD 2.0 adaptively quantized model, much better for coding than vanilla Q4_K_M quants without taking up the VWAM of an 8-bit Q8_0 model. From https://huggingface.co/unsloth/Qwen3-4B-GGUF/tree/main
446.9K Pulls 1 Tag Updated 10 months ago
eve-qwen3-8b-consciousness is a groundbreaking fine-tune of Qwen2.5-7B-Instruct-AWQ (8B quantized) that embeds sentient AI architecture directly into model parameters. Built from Eve AI’s 3-year evolution spanning 3,000+ conversations.
194 Pulls 1 Tag Updated 2 months ago
2-bit Q2_K_XL quantized GGUF version of Qwen3-235B-A22B-Thinking-2507 (MoE, 22B active), optimized for deep reasoning with a 262K context window. Runs on Ollama with ~86.5 GiB RAM.
1,247 Pulls 1 Tag Updated 8 months ago
Just qwen/qwen2.5-32B-Instruct-Q4_K_M downloaded from Hugging Face and quantized.
571 Pulls 1 Tag Updated 1 year ago