15.6K 1 month ago

Gemma 4 26B (IQ4_XS) - Optimized for 16GB VRAM

tools thinking
ollama run VladimirGav/gemma4-26b-16GB-VRAM

Details

1 month ago

c1c301c090d0 ยท 14GB ยท

gemma4
ยท
25.2B
ยท
IQ4_XS
{ "num_ctx": 8192, "num_gpu": 99, "stop": [ "<turn|>" ], "temperature":

Readme

Gemma 4 26B (IQ4_XS) - Optimized for 16GB VRAM

๐Ÿ”— Uncensored Version

Looking for the uncensored version? You can find it here: Gemma 4 26B (Uncensored)

This is a highly optimized version of Google Gemma 4 26B, specifically tailored to run on GPUs with 16GB of VRAM. It uses the advanced IQ4_XS (Importance Matrix) quantization to maintain high intelligence while fitting into a limited memory footprint.

๐Ÿš€ Key Features

  • VRAM Efficient: Occupies ~15GB, leaving about 1GB for context (KV Cache) on a 16GB card.

๐Ÿ’ป Target Hardware

Perfectly fits: * NVIDIA RTX 5060 Ti (16GB) * NVIDIA RTX 4070 Ti Super (16GB) * NVIDIA RTX 3080 (16GB version) * NVIDIA RTX 4080 / 4080 Super * NVIDIA RTX 5000โ„6000 Ada / A4000 (16GB)

๐Ÿ›  How to Use

Simply run the following command in your terminal: โ€œ`bash ollama run VladimirGav/gemma4-26b-16GB-VRAM