89 1 month ago

Gemma is a family of lightweight, state-of-the-art open models from Google, built from the same research and technology used to create the Gemini models. (quantized IQ4_XS)

vision 27b

Models

View all →

Readme

Feature Value
vision true (>=0.11.11)
thinking false
tools false
Device Speed, token/s Context VRAM, gb Versions
RTX 3090 24gb ~30 4096 17 IQ4_XS,0.12.2
RTX 3090 24gb ~30 15360 19 IQ4_XS,0.12.2
M1 Max 32gb ~14 4096 17 IQ4_XS,0.12.2
M1 Max 32gb ~13 15360 18 IQ4_XS,0.12.2