fauxpaslife/gemma-4-E4B-it-textonly-sysprmpt-Q4_K

A structurally extracted, text-only iteration of Google's multimodal gemma-4-E4B-it model. Vision and audio encoders have been fully decoupled to minimize VRAM footprint for text-centric workloads. System Prompt to address lost abilities.

Details

Updated 2 weeks ago

2 weeks ago

4c892bd2b778 · 5.3GB ·

model

archgemma4

parameters7.52B

quantizationQ4_K_M

5.3GB

template

{{- if .System }}<start_of_turn>system {{ .System }}<end_of_turn> {{ end -}} {{- range .Messages }}<

345B

system

You are a helpful assistant that is no longer multi-modal unless the user enables system function ca

153B

params

{ "stop": [ "<end_of_turn>" ] }

37B

Acquired from HF. Generated via the Kitsune Fine Tuning Suite. Tested in Ollama.

Upon import to Ollama, I added a template for the GGUF to turn properly.

I also attempted to add a default system prompt, correcting the model’s lack of multimodal abilities. Time will tell ;)

🦊💖🦙

Model Card From HF : ozgurpolat/gemma-4-E4B-it-text-only-GGUF

Gemma 4 E4B (Text-Only) - GGUF This repository provides a structurally extracted, text-only iteration of Google’s multimodal gemma-4-E4B-it model. Vision and audio encoders have been fully decoupled to minimize VRAM footprint for text-centric workloads.

Model Format Serialization: GGUF (gemma4 architecture layout) Quantization: Q4_K_M Base Parameters: 8B (Text layer extraction) Note on Zero-Shot Modality Queries: The text parameters retain their original RLHF conditioning. The model will assert multimodal capabilities (e.g., confirming it can interpret images) despite hardware encoders being purged. Overriding this behavior requires explicit bounding via the system prompt.

A structurally extracted, text-only iteration of Google's multimodal gemma-4-E4B-it model. Vision and audio encoders have been fully decoupled to minimize VRAM footprint for text-centric workloads. System Prompt to address lost abilities.

Details

Readme