16.3K 24 minutes ago

Gemma 4 Turbo is an optimized version of Google's Gemma 4 (9B) model, achieving 51% faster CPU inference through int4 quantization and performance tuning. Ideal for local AI assistants, tool calling, and chat applications on Windows systems without GPU.

vision tools thinking audio e2b e4b 26b 31b
a09715380783 · 112B
{
"num_batch": 512,
"num_ctx": 16384,
"num_thread": 8,
"repeat_penalty": 1.1,
"temperature": 0.75,
"top_k": 40,
"top_p": 0.9
}