382 Downloads Updated 20 hours ago
ollama run igorls/gemma-4-12B-it-qat-q4_0-unquantized-heretic
A decensored (“abliterated”) build of Google’s Gemma 4 12B QAT-Q4_0 checkpoint, produced with Heretic. Refusals are removed while keeping the base model’s intelligence intact (low KL divergence), making it well suited to open-ended assistant and adult (18+) companion/roleplay use.
ollama run igorls/gemma-4-12B-it-qat-q4_0-unquantized-heretic
| Tag | Size | Notes |
|---|---|---|
Q4_0 / latest |
6.5 GB | Recommended. QAT-matched: the base was quantization-aware-trained for the Q4_0 grid, so this 4-bit quant best preserves quality |
Q4_K_M |
6.9 GB | Standard K-quant 4-bit alternative |
Q8_0 |
11.8 GB | Near-lossless |
ollama run igorls/gemma-4-12B-it-qat-q4_0-unquantized-heretic:Q8_0
google/gemma-4-12B-it-qat-q4_0-unquantized (Gemma 4, gemma4 architecture)Safetensors and additional info: huggingface.co/igorls/gemma-4-12B-it-qat-q4_0-unquantized-heretic
gemma4) support.This is an abliterated model: its built-in safety guardrails and refusal behavior have been deliberately removed. As a result it will attempt to answer essentially any prompt and can produce content that is offensive, inaccurate, explicit, or otherwise harmful — content the original Gemma 4 would have refused.