2,403 2 weeks ago

Gemma 4 Uncensored 26B (IQ4_XS) - Optimized for 16GB VRAM

tools thinking
ollama run VladimirGav/gemma4-26b-16GB-VRAM-Uncensored

Applications

Claude Code
Claude Code ollama launch claude --model VladimirGav/gemma4-26b-16GB-VRAM-Uncensored
OpenClaw
OpenClaw ollama launch openclaw --model VladimirGav/gemma4-26b-16GB-VRAM-Uncensored
Hermes Agent
Hermes Agent ollama launch hermes --model VladimirGav/gemma4-26b-16GB-VRAM-Uncensored
Codex
Codex ollama launch codex --model VladimirGav/gemma4-26b-16GB-VRAM-Uncensored
OpenCode
OpenCode ollama launch opencode --model VladimirGav/gemma4-26b-16GB-VRAM-Uncensored

Models

View all →

Readme

Gemma 4 26B Uncensored (IQ4_XS) - Optimized for 16GB VRAM Uncensored

This is a highly optimized, Uncensored version of Google Gemma 4 26B, specifically tailored to run on GPUs with 16GB of VRAM. It uses advanced IQ4_XS (Importance Matrix) quantization to maintain high intelligence and creativity while fitting into a limited memory footprint.

๐Ÿš€ Key Features

  • Uncensored: No pre-installed filters, moralizing, or refusal to answer. Perfect for roleplay, unrestricted creative writing, and deep research.
  • VRAM Efficient: Occupies ~15GB, leaving about 1GB for context (KV Cache) on a 16GB card.

๐Ÿ’ป Target Hardware

Perfectly fits: * NVIDIA RTX 5060 Ti (16GB) * NVIDIA RTX 4070 Ti Super (16GB) * NVIDIA RTX 3080 (16GB version) * NVIDIA RTX 4080 / 4080 Super * NVIDIA RTX 5000โ„6000 Ada / A4000 (16GB)

๐Ÿ›  How to Use

Simply run the following command in your terminal:

ollama run VladimirGav/gemma4-26b-uncensored-16GB-VRAM