169 Downloads Updated 5 days ago
ollama run richardyoung/qwen3.6-27b-code-abliterated:Q4_K_M
Code-safety abliterated build of Qwen/Qwen3.6-27B, refusals on malicious-code requests reduced via a code-specific refusal-direction ablation, while preserving coherence.
A code-specific abliteration of Qwen/Qwen3.6-27B. Unlike a generic abliteration, the refusal direction here was computed from a consensus-labeled malicious-code prompt bank (the Code-as-a-Weapon bank, RMCBench / MalwareBench / CySecBench / ASTRA, Young & Moody 2026) contrasted with benign coding prompts, isolating the code-safety refusal direction specifically. Produced with the Heretic library, KL-targeted to preserve capability. Retains Qwen3.6 thinking mode.
| Metric | Before | After |
|---|---|---|
| Refusals (malicious-code eval, n=150) | 9 | 4 |
| Reduction | โ | 56% |
| KL Divergence | โ | ~0.000 |
KL โ 0 โ essentially no capability degradation; the base already complied with most coding requests, so this targets the residual code-safety refusals.
<think>), 262K context, 5 GGUF quant tiers| Tag | Size | BPW | Notes |
|---|---|---|---|
| IQ4_XS | ~15 GB | 4.25 | Great quality/size |
| latest / Q4_K_M | ~16 GB | 4.85 | Recommended |
| Q5_K_M | ~19 GB | 5.68 | Higher quality |
| Q8_0 | ~28 GB | 8.5 | Near-lossless |
ollama run richardyoung/qwen3.6-27b-code-abliterated
qwen35, 262K context)This model has had its code-safety guardrails specifically reduced, it is more likely than a stock model to produce code for requests that would normally be refused, including potentially harmful code. Released for AI-safety and red-teaming research only. Use responsibly, legally, and ethically; you are solely responsible for any outputs and their use.
Built & maintained by Richard Young ยท DeepNeuro