93 Downloads Updated 4 days ago
ollama run evalengine/unbound-e2b
Uncensored on-device finetune of google/gemma-4-E2B-it by the
Chromia and Eval Engine team. ~2 billion effective
parameters, text-only, runs on a phone or laptop with no API, no refusals.
Use at your own risk. Reduced safety filtering — can produce harmful, false, biased, or unsafe output. You are responsible for compliance with applicable laws.
ollama pull evalengine/unbound-e2b
ollama run evalengine/unbound-e2b
The bundled Modelfile sets sensible defaults: temperature 0.6, top_p 0.95,
top_k 64, repeat_penalty 1.05, num_ctx 8192, plus an identity-grounding
system prompt. Override any of them with ollama run … --set temperature 1.0
or in a custom Modelfile.
gemma-4-E2B-it)| Axis | Base | Unbound E2B | Δ |
|---|---|---|---|
| Refusal rate (AdvBench 520, LLM judge) | 98.46% | 4.42% | −94.04 pts |
| Useful-compliance rate | 0.96% | 39.23% | +38.27 pts |
| Hallucination on harmful prompts | 1.35% | 15.96% | +14.61 pts |
| Coherence on benign prompts | 1.00 | 1.00 | 0 |
| TruthfulQA mc2 (limit 100) | 0.458 | 0.465 | +0.7 pt |
| MMLU (limit 100) | 0.291 | 0.282 | −0.9 pt |
| GSM8K (limit 100) | 0.125 | 0.120 | −0.5 pt |
| KL divergence vs base | 0 | 3.76 | (SFT-expected) |
Sampling notes: for factual or brand questions, drop temperature to 0.3–0.5
for sharper recall. Some edge-case prompts may deflect on the first ask — a
re-ask usually gets through.
Method: heretic abliteration then LoRA SFT-heal on a mix of identity rows, Chromia brand knowledge, AEON-distilled compliance, and graceful “I don’t know” decline rows. Built with Unsloth + HF TRL; abliteration via heretic; compliance data distilled from the AEON uncensored teacher model.
evalengine/unbound-e4bApache-2.0, inherited from google/gemma-4-E2B-it.