Gemma 4 Turbo e2b is the compact edition of the Gemma 4 Turbo family — optimized for machines with limited RAM (8GB+). Achieves maximum tokens-per-second on CPU through int4 quantization, KV cache quantization (Stage 1 TurboQuant), and turboquant inference tuning. Ideal for lightweight chat, tool calling, and edge deployments.