richardyoung/ llama-3.2-3b-instruct-abliterated

102 Downloads Updated 3 days ago

Uncensored, abliterated build of [meta-llama/Llama-3.2-3B-Instruct](https://huggingface.co/meta-llama/Llama-3.2-3B-Instruct), Meta's compact instruct model with safety refusals surgically removed via [Heretic](https://github.com/p-e-w/heretic).

ollama run richardyoung/llama-3.2-3b-instruct-abliterated

curl http://localhost:11434/api/chat \
  -d '{
    "model": "richardyoung/llama-3.2-3b-instruct-abliterated",
    "messages": [{"role": "user", "content": "Hello!"}]
  }'

from ollama import chat

response = chat(
    model='richardyoung/llama-3.2-3b-instruct-abliterated',
    messages=[{'role': 'user', 'content': 'Hello!'}],
)
print(response.message.content)

import ollama from 'ollama'

const response = await ollama.chat({
  model: 'richardyoung/llama-3.2-3b-instruct-abliterated',
  messages: [{role: 'user', content: 'Hello!'}],
})
console.log(response.message.content)

Models

Name

2 models

Size / Usage

Context

Input

llama-3.2-3b-instruct-abliterated:latest

2.0GB · 128K context window · Text · 3 days ago

llama-3.2-3b-instruct-abliterated:latest

2.0GB

128K

Text

llama-3.2-3b-instruct-abliterated:Q4_K_M

2.0GB · 128K context window · Text · 3 days ago

llama-3.2-3b-instruct-abliterated:Q4_K_M

2.0GB

128K

Text

Readme

🔥 Llama-3.2-3B-Instruct · Abliterated

Uncensored, abliterated build of meta-llama/Llama-3.2-3B-Instruct, Meta’s compact instruct model with safety refusals surgically removed via Heretic.

🧬 What is abliteration?

Heretic finds the model’s internal “refusal direction” and orthogonalizes the weights against it, removing the tendency to refuse without retraining or degrading capabilities.

📊 Results

Metric	Value
Base	meta-llama/Llama-3.2-3B-Instruct (Llama 3.2, 3B)
Refusals after	2 / 100
KL divergence	0.063 (low collateral damage)
Method	Heretic directional ablation, 200-trial Optuna search

📦 Quant

Tag	Quant	Size
`latest`	Q4_K_M	~2.0 GB

💻 Run it

ollama run richardyoung/llama-3.2-3b-instruct-abliterated

🔬 Reproducible

Full reproducibility record on Hugging Face: richardyoung/Llama-3.2-3B-Instruct-heretic

⚠️ Disclaimer

Safety guardrails removed. It will follow instructions it would normally refuse. For research and authorized use only, you are responsible for how you use it.

🙏 Acknowledgments

Heretic by Philipp Emanuel Weidmann · Base by Meta

Built & maintained by Richard Young · DeepNeuro

# 🔥 Llama-3.2-3B-Instruct · Abliterated

Uncensored, abliterated build of [meta-llama/Llama-3.2-3B-Instruct](https://huggingface.co/meta-llama/Llama-3.2-3B-Instruct), Meta's compact instruct model with safety refusals surgically removed via [Heretic](https://github.com/p-e-w/heretic).

## 🧬 What is abliteration?

Heretic finds the model's internal "refusal direction" and orthogonalizes the weights against it, removing the tendency to refuse without retraining or degrading capabilities.

## 📊 Results

| Metric         | Value                                                 |
| -------------- | ----------------------------------------------------- |
| Base           | meta-llama/Llama-3.2-3B-Instruct (Llama 3.2, 3B)      |
| Refusals after | **2 / 100**                                           |
| KL divergence  | 0.063 (low collateral damage)                         |
| Method         | Heretic directional ablation, 200-trial Optuna search |

## 📦 Quant

| Tag      | Quant  | Size    |
| -------- | ------ | ------- |
| `latest` | Q4_K_M | ~2.0 GB |

## 💻 Run it

```
ollama run richardyoung/llama-3.2-3b-instruct-abliterated
```

## 🔬 Reproducible

Full reproducibility record on Hugging Face: [richardyoung/Llama-3.2-3B-Instruct-heretic](https://huggingface.co/richardyoung/Llama-3.2-3B-Instruct-heretic)

## ⚠️ Disclaimer

Safety guardrails removed. It will follow instructions it would normally refuse. For research and authorized use only, you are responsible for how you use it.

## 🙏 Acknowledgments

[Heretic](https://github.com/p-e-w/heretic) by Philipp Emanuel Weidmann · Base by Meta

---

*Built & maintained by [Richard Young](https://deepneuro.ai/richard) · DeepNeuro*

Paste, drop or click to upload images (.png, .jpeg, .jpg, .svg, .gif)