-
llama3.2-vision
Llama 3.2 Vision is a collection of instruction-tuned image reasoning generative models in 11B and 90B sizes.
vision 11b 90b4M Pulls 9 Tags Updated 10 months ago
-
granite3.2-vision
A compact and efficient vision-language model, specifically designed for visual document understanding, enabling automated content extraction from tables, charts, infographics, plots, diagrams, and more.
vision tools 2b821.5K Pulls 5 Tags Updated 1 year ago