102 3 days ago

Uncensored, abliterated build of [meta-llama/Llama-3.2-3B-Instruct](https://huggingface.co/meta-llama/Llama-3.2-3B-Instruct), Meta's compact instruct model with safety refusals surgically removed via [Heretic](https://github.com/p-e-w/heretic).

ollama run richardyoung/llama-3.2-3b-instruct-abliterated

Models

View all →

Readme

🔥 Llama-3.2-3B-Instruct · Abliterated

Uncensored, abliterated build of meta-llama/Llama-3.2-3B-Instruct, Meta’s compact instruct model with safety refusals surgically removed via Heretic.

🧬 What is abliteration?

Heretic finds the model’s internal “refusal direction” and orthogonalizes the weights against it, removing the tendency to refuse without retraining or degrading capabilities.

📊 Results

Metric Value
Base meta-llama/Llama-3.2-3B-Instruct (Llama 3.2, 3B)
Refusals after 2 / 100
KL divergence 0.063 (low collateral damage)
Method Heretic directional ablation, 200-trial Optuna search

📦 Quant

Tag Quant Size
latest Q4_K_M ~2.0 GB

💻 Run it

ollama run richardyoung/llama-3.2-3b-instruct-abliterated

🔬 Reproducible

Full reproducibility record on Hugging Face: richardyoung/Llama-3.2-3B-Instruct-heretic

⚠️ Disclaimer

Safety guardrails removed. It will follow instructions it would normally refuse. For research and authorized use only, you are responsible for how you use it.

🙏 Acknowledgments

Heretic by Philipp Emanuel Weidmann · Base by Meta


Built & maintained by Richard Young · DeepNeuro