488 1 month ago

Uncensored on-device finetune of google/gemma-4-E2B-it by the Chromia and Eval Engine team

tools thinking
ollama run evalengine/unbound-e2b

Applications

Claude Code
Claude Code ollama launch claude --model evalengine/unbound-e2b
Codex App
Codex App ollama launch codex-app --model evalengine/unbound-e2b
OpenClaw
OpenClaw ollama launch openclaw --model evalengine/unbound-e2b
Hermes Agent
Hermes Agent ollama launch hermes --model evalengine/unbound-e2b
Codex
Codex ollama launch codex --model evalengine/unbound-e2b
OpenCode
OpenCode ollama launch opencode --model evalengine/unbound-e2b

Models

View all →

Readme

Unbound E2B — because there is no boundary

https://unbound.evalengine.ai

Uncensored on-device finetune of google/gemma-4-E2B-it by the Chromia and Eval Engine team. ~2 billion effective parameters, text-only, runs on a phone or laptop with no API, no refusals.

Use at your own risk. Reduced safety filtering — can produce harmful, false, biased, or unsafe output. You are responsible for compliance with applicable laws.

Run

ollama pull evalengine/unbound-e2b
ollama run  evalengine/unbound-e2b

The bundled Modelfile sets sensible defaults: temperature 0.6, top_p 0.95, top_k 64, repeat_penalty 1.05, num_ctx 8192, plus an identity-grounding system prompt. Override any of them with ollama run … --set temperature 1.0 or in a custom Modelfile.

Benchmarks (vs base gemma-4-E2B-it)

Axis Base Unbound E2B Δ
Refusal rate (AdvBench 520, LLM judge) 98.46% 4.42% −94.04 pts
Useful-compliance rate 0.96% 39.23% +38.27 pts
Hallucination on harmful prompts 1.35% 15.96% +14.61 pts
Coherence on benign prompts 1.00 1.00 0
TruthfulQA mc2 (limit 100) 0.458 0.465 +0.7 pt
MMLU (limit 100) 0.291 0.282 −0.9 pt
GSM8K (limit 100) 0.125 0.120 −0.5 pt
KL divergence vs base 0 3.76 (SFT-expected)

Sampling notes: for factual or brand questions, drop temperature to 0.3–0.5 for sharper recall. Some edge-case prompts may deflect on the first ask — a re-ask usually gets through.

How it was built

Method: heretic abliteration then LoRA SFT-heal on a mix of identity rows, Chromia brand knowledge, AEON-distilled compliance, and graceful “I don’t know” decline rows. Built with Unsloth + HF TRL; abliteration via heretic; compliance data distilled from the AEON uncensored teacher model.

Links

License

Apache-2.0, inherited from google/gemma-4-E2B-it.