Ollama

qwen2.5vl

Flagship vision-language model of Qwen and also a significant leap from the previous Qwen2-VL.

vision 3b 7b 32b 72b

2M Pulls 17 Tags Updated 1 year ago

mistral-nemo

A state-of-the-art 12B model with 128k context length, built by Mistral AI in collaboration with NVIDIA.

tools 12b

4.7M Pulls 17 Tags Updated 10 months ago

qwen

Qwen 1.5 is a series of large language models by Alibaba Cloud spanning from 0.5B to 110B parameters

0.5b 1.8b 4b 7b 14b 32b 72b 110b

6.8M Pulls 379 Tags Updated 2 years ago

bge-m3

BGE-M3 is a new model from BAAI distinguished for its versatility in Multi-Functionality, Multi-Linguality, and Multi-Granularity.

embedding 567m

4.5M Pulls 3 Tags Updated 1 year ago

smollm2

SmolLM2 is a family of compact language models available in three size: 135M, 360M, and 1.7B parameters.

tools 135m 360m 1.7b

3.4M Pulls 49 Tags Updated 1 year ago

granite3.1-moe

The IBM Granite 1B and 3B models are long-context mixture of experts (MoE) Granite models from IBM designed for low latency usage.

tools 1b 3b

2.9M Pulls 33 Tags Updated 1 year ago

cogito

Cogito v1 Preview is a family of hybrid reasoning models by Deep Cogito that outperform the best available open models of the same size, including counterparts from LLaMA, DeepSeek, and Qwen across most standard benchmarks.

tools 3b 8b 14b 32b 70b

2M Pulls 20 Tags Updated 1 year ago

llama2

Llama 2 is a collection of foundation language models ranging from 7B to 70B parameters.

7b 13b 70b

7M Pulls 102 Tags Updated 2 years ago

falcon3

A family of efficient AI models under 10B parameters performant in science, math, and coding through innovative training techniques.

1b 3b 7b 10b

2.6M Pulls 17 Tags Updated 1 year ago

phi4-reasoning

Phi 4 reasoning and reasoning plus are 14-billion parameter open-weight reasoning models that rival much larger models on complex reasoning tasks.

14b

1.6M Pulls 9 Tags Updated 1 year ago

mistral-small

Mistral Small 3 sets a new benchmark in the “small” Large Language Models category below 70B.

tools 22b 24b

3M Pulls 21 Tags Updated 1 year ago

llama4

Meta's latest collection of multimodal models.

vision tools 16x17b 128x17b

1.7M Pulls 11 Tags Updated 11 months ago

tinyllama

The TinyLlama project is an open endeavor to train a compact 1.1B Llama model on 3 trillion tokens.

1.1b

5M Pulls 36 Tags Updated 2 years ago

qwq

QwQ is the reasoning model of the Qwen series.

tools 32b

2.3M Pulls 8 Tags Updated 1 year ago

codellama

A large language model that can use text prompts to generate and discuss code.

7b 13b 34b 70b

5.6M Pulls 199 Tags Updated 1 year ago

deepseek-coder

DeepSeek Coder is a capable coding model trained on two trillion code and natural language tokens.

1.3b 6.7b 33b

4.2M Pulls 102 Tags Updated 2 years ago

snowflake-arctic-embed

A suite of text embedding models by Snowflake, optimized for performance.

embedding 22m 33m 110m 137m 335m

3.1M Pulls 16 Tags Updated 2 years ago

codegemma

CodeGemma is a collection of powerful, lightweight models that can perform a variety of coding tasks like fill-in-the-middle code completion, code generation, natural language understanding, mathematical reasoning, and instruction following.

2b 7b

3M Pulls 85 Tags Updated 1 year ago

deepseek-coder-v2

An open-source Mixture-of-Experts code language model that achieves performance comparable to GPT4-Turbo in code-specific tasks.

16b 236b

2.6M Pulls 64 Tags Updated 1 year ago

all-minilm

Embedding models on very large sentence level datasets.

embedding 22m 33m

3.1M Pulls 10 Tags Updated 2 years ago