ssfdre38/gemma4-turbo/system

ssfdre38/ gemma4-turbo:latest

16.3K Downloads Updated 23 minutes ago

Gemma 4 Turbo is an optimized version of Google's Gemma 4 (9B) model, achieving 51% faster CPU inference through int4 quantization and performance tuning. Ideal for local AI assistants, tool calling, and chat applications on Windows systems without GPU.

vision tools thinking audio e2b e4b 26b 31b

system

64eb28e31c93 · 311B

Gemma 4 Turbo is an optimized version of Google's Gemma 4 (9B) model, achieving 51% faster CPU inference through int4 quantization, KV cache quantization (Stage 1 TurboQuant), and performance tuning. Ideal for local AI assistants, tool calling, and chat applications on Windows systems without GPU acceleration.