18.5K 2 weeks ago

Gemma4-turbo is part of the G4Turbo.com family to try and bring the Gemma 4 Model to everyone. Please visit https://g4turbo.com/ for more information about what I am doing.

vision tools thinking audio e2b e4b 26b 31b
053dc7e9e895 · 439B
Gemma 4 Turbo is a purpose-built quantization of Google's Gemma 4 (9B) model. Built from original bf16 source weights using IQ4_XS non-linear quantization (4.25 bpw) with full multimodal/vision capability preserved. 36% smaller than stock Ollama builds (6.1 GB vs 9.6 GB). Runs on CPU-only hardware — no GPU required. Part of the ash-server ecosystem (github.com/ssfdre38/ash-server). Submitted to the Kaggle Gemma 4 Good Hackathon 2026.