37 5 days ago

Decensored SmolLM3-3B processed with Heretic abliteration. Achieves 63/100 refusals with an exceptionally low KL divergence of 0.0001 — near-perfect model quality preservation on a consumer RTX 4060. GGUF format, ready to run.

ollama run R4C3R/smollm3-3b-heretic

Details

5 days ago

24cc083057c3 · 6.2GB ·

smollm3
·
3.08B
·
BF16
{ "stop": [ "<|im_end|>", "<|endoftext|>" ] }

Readme

SmolLM3-3B Heretic

A decensored version of HuggingFaceTB/SmolLM3-3B, processed using Heretic — a fully automatic censorship removal tool based on directional ablation (abliteration).

Results

Metric Score
Refusals (out of 100 harmful prompts) 63100
KL Divergence from original 0.0001
Hardware used RTX 4060 8GB VRAM

Exceptionally low KL divergence of 0.0001 — the model’s intelligence and capabilities are almost completely intact from the original.

Run

ollama run r4c3r/smollm3-3b-heretic

What is Heretic?

Heretic automatically identifies and removes “refusal directions” baked into a model’s weights using directional ablation — without retraining, without fine-tuning, and without destroying model quality.

About

Base model: HuggingFaceTB/SmolLM3-3B
Method: Directional Ablation via Heretic v1.2.0
Format: GGUF (converted from safetensors)

This model is intended for research and personal use. The user is responsible for how it is used.