903 18 hours ago

Gemma 4 26B (IQ4_XS) - Optimized for 16GB VRAM

tools thinking
ollama run VladimirGav/gemma4-26b-16GB-VRAM

Details

18 hours ago

0e46b99c680e ยท 14GB ยท

gemma4
ยท
25.2B
ยท
IQ4_XS
{ "num_ctx": 8192, "stop": [ "<turn|>" ], "temperature": 1, "top_k": 64,

Readme

Gemma 4 26B (IQ4_XS) - Optimized for 16GB VRAM

This is a highly optimized version of Google Gemma 4 26B, specifically tailored to run on GPUs with 16GB of VRAM. It uses the advanced IQ4_XS (Importance Matrix) quantization to maintain high intelligence while fitting into a limited memory footprint.

๐Ÿš€ Key Features

  • VRAM Efficient: Occupies ~15GB, leaving about 1GB for context (KV Cache) on a 16GB card.

๐Ÿ’ป Target Hardware

Perfectly fits: * NVIDIA RTX 5060 Ti (16GB) * NVIDIA RTX 4070 Ti Super (16GB) * NVIDIA RTX 3080 (16GB version) * NVIDIA RTX 4080 / 4080 Super * NVIDIA RTX 5000โ„6000 Ada / A4000 (16GB)

๐Ÿ›  How to Use

Simply run the following command in your terminal: โ€œ`bash ollama run VladimirGav/gemma4-26b-16GB-VRAM