vlm · Ollama Search

qwen3-vl

The most powerful vision-language model in the Qwen model family to date.

vision tools cloud 2b 4b 8b 30b 32b 235b

798.1K Pulls 59 Tags Updated 1 month ago

wizardlm-uncensored

Uncensored version of Wizard LM model

13b

114.5K Pulls 18 Tags Updated 2 years ago

llava

🌋 LLaVA is a novel end-to-end trained large multimodal model that combines a vision encoder and Vicuna for general-purpose visual and language understanding. Updated to version 1.6.

vision 7b 13b 34b

11.9M Pulls 98 Tags Updated 1 year ago

minicpm-v

A series of multimodal LLMs (MLLMs) designed for vision-language understanding.

vision 8b

4.1M Pulls 17 Tags Updated 1 year ago

llama3.2-vision

Llama 3.2 Vision is a collection of instruction-tuned image reasoning generative models in 11B and 90B sizes.

vision 11b 90b

3.3M Pulls 9 Tags Updated 7 months ago

deepseek-v3

A strong Mixture-of-Experts (MoE) language model with 671B total parameters with 37B activated for each token.

671b

3M Pulls 5 Tags Updated 11 months ago

bge-m3

BGE-M3 is a new model from BAAI distinguished for its versatility in Multi-Functionality, Multi-Linguality, and Multi-Granularity.

embedding 567m

2.9M Pulls 3 Tags Updated 1 year ago

codegemma

CodeGemma is a collection of powerful, lightweight models that can perform a variety of coding tasks like fill-in-the-middle code completion, code generation, natural language understanding, mathematical reasoning, and instruction following.

2b 7b

1.7M Pulls 85 Tags Updated 1 year ago

deepseek-coder-v2

An open-source Mixture-of-Experts code language model that achieves performance comparable to GPT4-Turbo in code-specific tasks.

16b 236b

1.3M Pulls 64 Tags Updated 1 year ago

qwen2.5vl

Flagship vision-language model of Qwen and also a significant leap from the previous Qwen2-VL.

vision 3b 7b 32b 72b

1.1M Pulls 17 Tags Updated 7 months ago

granite3.2-vision

A compact and efficient vision-language model, specifically designed for visual document understanding, enabling automated content extraction from tables, charts, infographics, plots, diagrams, and more.

vision tools 2b

561.4K Pulls 5 Tags Updated 9 months ago

moondream

moondream2 is a small vision language model designed to run efficiently on edge devices.

vision 1.8b

454.8K Pulls 18 Tags Updated 1 year ago

hermes3

Hermes 3 is the latest version of the flagship Hermes series of LLMs by Nous Research

tools 3b 8b 70b 405b

370.7K Pulls 65 Tags Updated 1 year ago

vicuna

General use chat model based on Llama and Llama 2 with 2K to 16K context sizes.

7b 13b 33b

230.4K Pulls 111 Tags Updated 2 years ago

deepseek-v2

A strong, economical, and efficient Mixture-of-Experts language model.

16b 236b

227.2K Pulls 34 Tags Updated 1 year ago

orca2

Orca 2 is built by Microsoft research, and are a fine-tuned version of Meta's Llama 2 models. The model is designed to excel particularly in reasoning.

7b 13b

102.6K Pulls 33 Tags Updated 2 years ago

deepseek-ocr

DeepSeek-OCR is a vision-language model that can perform token-efficient OCR.

vision 3b

65.5K Pulls 3 Tags Updated 4 weeks ago

wizard-vicuna

Wizard Vicuna is a 13B parameter model based on Llama 2 trained by MelodysDreamj.

13b

64.3K Pulls 17 Tags Updated 2 years ago

command-r7b-arabic

A new state-of-the-art version of the lightweight Command R7B model that excels in advanced Arabic language capabilities for enterprises in the Middle East and Northern Africa.

tools 7b

42.5K Pulls 5 Tags Updated 9 months ago

cogito-2.1

The Cogito v2.1 LLMs are instruction tuned generative models. All models are released under MIT license for commercial use.

cloud 671b

18.7K Pulls 6 Tags Updated 3 weeks ago