Ollama
Models Docs Pricing
Sign in Download
Models Download Docs Pricing Sign in
⇅
Ollama
Search for models on Ollama.
  • qwen2.5vl

    Flagship vision-language model of Qwen and also a significant leap from the previous Qwen2-VL.

    vision 3b 7b 32b 72b

    2M  Pulls 17  Tags Updated  1 year ago

  • mistral-nemo

    A state-of-the-art 12B model with 128k context length, built by Mistral AI in collaboration with NVIDIA.

    tools 12b

    4.7M  Pulls 17  Tags Updated  10 months ago

  • qwen

    Qwen 1.5 is a series of large language models by Alibaba Cloud spanning from 0.5B to 110B parameters

    0.5b 1.8b 4b 7b 14b 32b 72b 110b

    6.8M  Pulls 379  Tags Updated  2 years ago

  • bge-m3

    BGE-M3 is a new model from BAAI distinguished for its versatility in Multi-Functionality, Multi-Linguality, and Multi-Granularity.

    embedding 567m

    4.5M  Pulls 3  Tags Updated  1 year ago

  • smollm2

    SmolLM2 is a family of compact language models available in three size: 135M, 360M, and 1.7B parameters.

    tools 135m 360m 1.7b

    3.4M  Pulls 49  Tags Updated  1 year ago

  • granite3.1-moe

    The IBM Granite 1B and 3B models are long-context mixture of experts (MoE) Granite models from IBM designed for low latency usage.

    tools 1b 3b

    2.9M  Pulls 33  Tags Updated  1 year ago

  • cogito

    Cogito v1 Preview is a family of hybrid reasoning models by Deep Cogito that outperform the best available open models of the same size, including counterparts from LLaMA, DeepSeek, and Qwen across most standard benchmarks.

    tools 3b 8b 14b 32b 70b

    2M  Pulls 20  Tags Updated  1 year ago

  • llama2

    Llama 2 is a collection of foundation language models ranging from 7B to 70B parameters.

    7b 13b 70b

    7M  Pulls 102  Tags Updated  2 years ago

  • falcon3

    A family of efficient AI models under 10B parameters performant in science, math, and coding through innovative training techniques.

    1b 3b 7b 10b

    2.6M  Pulls 17  Tags Updated  1 year ago

  • phi4-reasoning

    Phi 4 reasoning and reasoning plus are 14-billion parameter open-weight reasoning models that rival much larger models on complex reasoning tasks.

    14b

    1.6M  Pulls 9  Tags Updated  1 year ago

  • mistral-small

    Mistral Small 3 sets a new benchmark in the “small” Large Language Models category below 70B.

    tools 22b 24b

    3M  Pulls 21  Tags Updated  1 year ago

  • llama4

    Meta's latest collection of multimodal models.

    vision tools 16x17b 128x17b

    1.7M  Pulls 11  Tags Updated  11 months ago

  • tinyllama

    The TinyLlama project is an open endeavor to train a compact 1.1B Llama model on 3 trillion tokens.

    1.1b

    5M  Pulls 36  Tags Updated  2 years ago

  • qwq

    QwQ is the reasoning model of the Qwen series.

    tools 32b

    2.3M  Pulls 8  Tags Updated  1 year ago

  • codellama

    A large language model that can use text prompts to generate and discuss code.

    7b 13b 34b 70b

    5.6M  Pulls 199  Tags Updated  1 year ago

  • deepseek-coder

    DeepSeek Coder is a capable coding model trained on two trillion code and natural language tokens.

    1.3b 6.7b 33b

    4.2M  Pulls 102  Tags Updated  2 years ago

  • snowflake-arctic-embed

    A suite of text embedding models by Snowflake, optimized for performance.

    embedding 22m 33m 110m 137m 335m

    3.1M  Pulls 16  Tags Updated  2 years ago

  • codegemma

    CodeGemma is a collection of powerful, lightweight models that can perform a variety of coding tasks like fill-in-the-middle code completion, code generation, natural language understanding, mathematical reasoning, and instruction following.

    2b 7b

    3M  Pulls 85  Tags Updated  1 year ago

  • deepseek-coder-v2

    An open-source Mixture-of-Experts code language model that achieves performance comparable to GPT4-Turbo in code-specific tasks.

    16b 236b

    2.6M  Pulls 64  Tags Updated  1 year ago

  • all-minilm

    Embedding models on very large sentence level datasets.

    embedding 22m 33m

    3.1M  Pulls 10  Tags Updated  2 years ago

© 2026 Ollama
Blog Contact