128 7 months ago

vision tools 2b
ollama run ibm/granite3.2-vision:2b

Applications

Claude Code
Claude Code ollama launch claude --model ibm/granite3.2-vision:2b
Codex
Codex ollama launch codex --model ibm/granite3.2-vision:2b
OpenCode
OpenCode ollama launch opencode --model ibm/granite3.2-vision:2b
OpenClaw
OpenClaw ollama launch openclaw --model ibm/granite3.2-vision:2b

Models

View all →

Readme

Granite 3.2 Vision models

A compact and efficient vision-language model, specifically designed for visual document understanding, enabling automated content extraction from tables, charts, infographics, plots, diagrams, and more. The model was trained on a meticulously curated instruction-following dataset, comprising diverse public datasets and synthetic datasets tailored to support a wide range of document understanding and general image tasks. It was trained by fine-tuning a Granite large language model with both image and text modalities.

Running

ollama run ibm/granite3.2-vision

Learn more