147 3 days ago

Abliterated (refusal-direction removed, Arditi et al. 2024) variant of `Qwen/Qwen3.5-4B`. **Not** fine-tuned — no preference/instruction data added; only the refusal direction is orthogonalized out.

d04280cc6ba9 · 204B
Apache-2.0. Derivative of Qwen/Qwen3.5-4B (Apache-2.0). Abliterated (refusal
direction removed) by the KAINE project. Uncensored by design; use within an
appropriate safety framework and applicable law.