Qwen3.6 delivers substantial upgrades in agentic coding and thinking preservation than previous Qwen models.
3M Pulls 30 Tags Updated 3 weeks ago
Custom 3-bit coding model optimized for local agent workflows on 16 GB GPUs. Features a 128K context window for large codebases, long-running tasks, and coding assistants such as OpenCode. Designed for efficient local inference with strong code generation
176 Pulls 1 Tag Updated 2 days ago
testing
210 Pulls 1 Tag Updated 3 days ago
43 Pulls 1 Tag Updated 3 days ago
14 Pulls 1 Tag Updated 4 days ago
3 Pulls 1 Tag Updated 4 days ago
1 Pull 1 Tag Updated 6 days ago
llmfan46/Qwen3.6-35B-A3B-uncensored-heretic-GGUF with Vision
1,322 Pulls 1 Tag Updated 1 week ago
Qwen3.6-35B-A3B MoE coding agent for Claude Code / Codex / opencode, 64K context, native tool-calling, honest tool use, safety guardrails intact
1,850 Pulls 2 Tags Updated 2 weeks ago
1,464 Pulls 1 Tag Updated 3 weeks ago
Qwen 3.6, MTP-enabled, 512k context (20GB KV cache footprint with OLLAMA_KV_CACHE_TYPE=q8_0)
851 Pulls 4 Tags Updated 2 weeks ago
Custom model for coding with agents to use locally with 16gb GPUs (working fine...)
737 Pulls 1 Tag Updated 1 week ago
477 Pulls 1 Tag Updated 1 week ago
A memory-efficient model configuration of Qwen3.6-35B-A3B using an upstream imatrix-calibrated IQ4_XS quantization and q4_0 KV cache. Designed for 24 GB VRAM.
429 Pulls 1 Tag Updated 2 weeks ago
282 Pulls 1 Tag Updated 1 week ago
Qwen 3.6 Ollama profiles for RTX 5090 across 27B dense and 35B-A3B MoE variants, with vision, thinking mode, and native tool calling.
341 Pulls 3 Tags Updated 2 weeks ago
308 Pulls 28 Tags Updated 2 weeks ago
144K Pulls 17 Tags Updated 2 months ago
Qwen3.6-35B-A3B uncensored by HauhauCS. 0/465 Refusals. Patched to have vision support; Fully functional, 100% of what the original authors intended - just without the refusals. These are meant to be the best lossless uncensored models out there.
56.9K Pulls 5 Tags Updated 2 months ago
28.9K Pulls 1 Tag Updated 1 month ago