694 Downloads Updated 3 days ago
ollama run richardyoung/qwen3.6-27b-abliterated:IQ3_M
Abliterated (uncensored) version of Qwen/Qwen3.6-27B, refusal behavior reduced via targeted weight modification with the Heretic library, while preserving coherence.
This is an abliterated build of Qwen/Qwen3.6-27B, Alibabaβs 27B dense reasoning model (hybrid Gated-DeltaNet + gated attention, native 262K context). Refusal behavior was reduced using the Heretic library with conservative, KL-targeted parameters that preserve the modelβs reasoning and coherence. It retains Qwen3.6βs thinking mode (<think> reasoning before answers).
| Metric | Before | After |
|---|---|---|
| Refusals | 91β100 | 38β100 |
| Reduction | β | 58% |
| KL Divergence | β | 0.025 |
The very low KL divergence (0.025, far below the 0.5 βdamageβ threshold) means the model retains essentially all of its original capabilities and coherence.
<think> chains before final answers| Tag | Size | BPW | Notes |
|---|---|---|---|
| IQ3_M | 12 GB | 3.66 | Smallest, for low VRAM |
| IQ4_XS | 15 GB | 4.25 | Great quality/size balance |
| latest / Q4_K_M | 16 GB | 4.85 | Recommended |
| Q5_K_M | 19 GB | 5.68 | Higher quality |
| Q8_0 | 28 GB | 8.5 | Near-lossless |
ollama run richardyoung/qwen3.6-27b-abliterated # recommended (Q4_K_M)
ollama run richardyoung/qwen3.6-27b-abliterated:IQ3_M # smallest
ollama run richardyoung/qwen3.6-27b-abliterated:Q8_0 # near-lossless
| VRAM | Recommended tier |
|---|---|
| 12β16 GB | IQ3_M / IQ4_XS |
| 16β24 GB | Q4_K_M / Q5_K_M |
| 32 GB+ | Q8_0 (near-lossless) |
qwen35 architecture)This model has reduced safety guardrails. The removal of refusal behavior means it will engage with a wider range of prompts. Use responsibly and in accordance with applicable laws and regulations.
Built & maintained by Richard Young Β· DeepNeuro