20.4K 2 weeks ago

Gemma4-turbo is part of the G4Turbo.com family to try and bring the Gemma 4 Model to everyone. Please visit https://g4turbo.com/ for more information about what I am doing.

vision tools thinking audio e2b e4b 12b 26b 31b
17eff2f85b7f · 330B
Gemma 4 Turbo e2b is the compact edition of the Gemma 4 Turbo family — optimized for machines with limited RAM (8GB+). Achieves maximum tokens-per-second on CPU through int4 quantization, KV cache quantization (Stage 1 TurboQuant), and turboquant inference tuning. Ideal for lightweight chat, tool calling, and edge deployments.