14 3 hours ago

Ultra-compressed Gemma 4 models (Q3_K_S quantization) optimized for mobile, edge, and resource-constrained environments. 50-57% smaller than stock models with 13% faster inference.

tools thinking e2b e4b 26b 31b
c56ad992639a · 202B
{{ if .System }}<start_of_turn>system
{{ .System }}<end_of_turn>
{{ end }}{{ if .Prompt }}<start_of_turn>user
{{ .Prompt }}<end_of_turn>
{{ end }}<start_of_turn>model
{{ .Response }}<end_of_turn>