Ollama
Models GitHub Discord Turbo
Sign in Download
Models Download GitHub Discord Sign in
⇅
Tools models · Ollama Search
Search for Tools models on Ollama.
  • deepseek-v3.1

    DeepSeek-V3.1 is a hybrid model that supports both thinking mode and non-thinking mode.

    tools thinking 671b

    33.4K  Pulls 4  Tags Updated  1 week ago

  • gpt-oss

    OpenAI’s open-weight models designed for powerful reasoning, agentic tasks, and versatile developer use cases.

    tools thinking 20b 120b

    2M  Pulls 3  Tags Updated  3 weeks ago

  • mistral-small3.2

    An update to Mistral Small that improves on function calling, instruction following, and less repetition errors.

    vision tools 24b

    264.7K  Pulls 5  Tags Updated  2 months ago

  • magistral

    Magistral is a small, efficient reasoning model with 24B parameters.

    tools thinking 24b

    346.6K  Pulls 5  Tags Updated  2 months ago

  • devstral

    Devstral: the best open source model for coding agents

    tools 24b

    323.8K  Pulls 5  Tags Updated  2 months ago

  • qwen3

    Qwen3 is the latest generation of large language models in Qwen series, offering a comprehensive suite of dense and mixture-of-experts (MoE) models.

    tools thinking 0.6b 1.7b 4b 8b 14b 30b 32b 235b

    7.6M  Pulls 56  Tags Updated  1 month ago

  • granite3.3

    IBM Granite 2B and 8B models are 128K context length language models that have been fine-tuned for improved reasoning and instruction-following capabilities.

    tools 2b 8b

    432.1K  Pulls 3  Tags Updated  4 months ago

  • mistral-small3.1

    Building upon Mistral Small 3, Mistral Small 3.1 (2503) adds state-of-the-art vision understanding and enhances long context capabilities up to 128k tokens without compromising text performance.

    vision tools 24b

    258K  Pulls 5  Tags Updated  5 months ago

  • cogito

    Cogito v1 Preview is a family of hybrid reasoning models by Deep Cogito that outperform the best available open models of the same size, including counterparts from LLaMA, DeepSeek, and Qwen across most standard benchmarks.

    tools 3b 8b 14b 32b 70b

    361.4K  Pulls 20  Tags Updated  5 months ago

  • llama4

    Meta's latest collection of multimodal models.

    vision tools 16x17b 128x17b

    655.9K  Pulls 11  Tags Updated  2 months ago

  • command-a

    111 billion parameter model optimized for demanding enterprises that require fast, secure, and high-quality AI

    tools 111b

    53.9K  Pulls 5  Tags Updated  5 months ago

  • command-r7b-arabic

    A new state-of-the-art version of the lightweight Command R7B model that excels in advanced Arabic language capabilities for enterprises in the Middle East and Northern Africa.

    tools 7b

    17.4K  Pulls 5  Tags Updated  6 months ago

  • granite3.2-vision

    A compact and efficient vision-language model, specifically designed for visual document understanding, enabling automated content extraction from tables, charts, infographics, plots, diagrams, and more.

    vision tools 2b

    282.3K  Pulls 5  Tags Updated  6 months ago

  • phi4-mini

    Phi-4-mini brings significant enhancements in multilingual support, reasoning, and mathematics, and now, the long-awaited function calling feature is finally supported.

    tools 3.8b

    340.1K  Pulls 5  Tags Updated  6 months ago

  • granite3.2

    Granite-3.2 is a family of long-context AI models from IBM Granite fine-tuned for thinking capabilities.

    tools 2b 8b

    152K  Pulls 9  Tags Updated  6 months ago

  • deepseek-r1

    DeepSeek-R1 is a family of open reasoning models with performance approaching that of leading models, such as O3 and Gemini 2.5 Pro.

    tools thinking 1.5b 7b 8b 14b 32b 70b 671b

    61M  Pulls 35  Tags Updated  2 months ago

  • command-r7b

    The smallest model in Cohere's R series delivers top-tier speed, efficiency, and quality to build powerful AI applications on commodity GPUs and edge devices.

    tools 7b

    55.1K  Pulls 5  Tags Updated  7 months ago

  • granite3.1-dense

    The IBM Granite 2B and 8B models are text-only dense LLMs trained on over 12 trillion tokens of data, demonstrated significant improvements over their predecessors in performance and speed in IBM’s initial testing.

    tools 2b 8b

    116.7K  Pulls 33  Tags Updated  7 months ago

  • granite3.1-moe

    The IBM Granite 1B and 3B models are long-context mixture of experts (MoE) Granite models from IBM designed for low latency usage.

    tools 1b 3b

    454.5K  Pulls 33  Tags Updated  7 months ago

  • llama3.3

    New state of the art 70B model. Llama 3.3 70B offers similar performance compared to the Llama 3.1 405B model.

    tools 70b

    2.4M  Pulls 14  Tags Updated  9 months ago

  • qwq

    QwQ is the reasoning model of the Qwen series.

    tools 32b

    1.6M  Pulls 8  Tags Updated  5 months ago

  • athene-v2

    Athene-V2 is a 72B parameter model which excels at code completion, mathematics, and log extraction tasks.

    tools 72b

    95.5K  Pulls 17  Tags Updated  9 months ago

  • smollm2

    SmolLM2 is a family of compact language models available in three size: 135M, 360M, and 1.7B parameters.

    tools 135m 360m 1.7b

    1.6M  Pulls 49  Tags Updated  10 months ago

  • aya-expanse

    Cohere For AI's language models trained to perform well across 23 different languages.

    tools 8b 32b

    81.1K  Pulls 33  Tags Updated  10 months ago

  • granite3-dense

    The IBM Granite 2B and 8B models are designed to support tool-based use cases and support for retrieval augmented generation (RAG), streamlining code generation, translation and bug fixing.

    tools 2b 8b

    116.5K  Pulls 33  Tags Updated  9 months ago

  • granite3-moe

    The IBM Granite 1B and 3B models are the first mixture of experts (MoE) Granite models from IBM designed for low latency usage.

    tools 1b 3b

    84K  Pulls 33  Tags Updated  9 months ago

  • nemotron

    Llama-3.1-Nemotron-70B-Instruct is a large language model customized by NVIDIA to improve the helpfulness of LLM generated responses to user queries.

    tools 70b

    92.6K  Pulls 17  Tags Updated  10 months ago

  • llama3.2

    Meta's Llama 3.2 goes small with 1B and 3B models.

    tools 1b 3b

    33.8M  Pulls 63  Tags Updated  11 months ago

  • qwen2.5-coder

    The latest series of Code-Specific Qwen models, with significant improvements in code generation, code reasoning, and code fixing.

    tools 0.5b 1.5b 3b 7b 14b 32b

    6.8M  Pulls 199  Tags Updated  3 months ago

  • nemotron-mini

    A commercial-friendly small language model by NVIDIA optimized for roleplay, RAG QA, and function calling.

    tools 4b

    102.9K  Pulls 17  Tags Updated  11 months ago

  • qwen2.5

    Qwen2.5 models are pretrained on Alibaba's latest large-scale dataset, encompassing up to 18 trillion tokens. The model supports up to 128K tokens and has multilingual support.

    tools 0.5b 1.5b 3b 7b 14b 32b 72b

    13.5M  Pulls 133  Tags Updated  11 months ago

  • mistral-small

    Mistral Small 3 sets a new benchmark in the “small” Large Language Models category below 70B.

    tools 22b 24b

    1.7M  Pulls 21  Tags Updated  7 months ago

  • hermes3

    Hermes 3 is the latest version of the flagship Hermes series of LLMs by Nous Research

    tools 3b 8b 70b 405b

    333.8K  Pulls 65  Tags Updated  8 months ago

  • mistral-large

    Mistral Large 2 is Mistral's new flagship model that is significantly more capable in code generation, mathematics, and reasoning with 128k context window and support for dozens of languages.

    tools 123b

    254.7K  Pulls 32  Tags Updated  9 months ago

  • llama3.1

    Llama 3.1 is a new state-of-the-art model from Meta available in 8B, 70B and 405B parameter sizes.

    tools 8b 70b 405b

    101.8M  Pulls 93  Tags Updated  9 months ago

  • mistral-nemo

    A state-of-the-art 12B model with 128k context length, built by Mistral AI in collaboration with NVIDIA.

    tools 12b

    2.5M  Pulls 17  Tags Updated  1 month ago

  • firefunction-v2

    An open weights function calling model based on Llama 3, competitive with GPT-4o function calling capabilities.

    tools 70b

    32.2K  Pulls 17  Tags Updated  1 year ago

  • llama3-groq-tool-use

    A series of models from Groq that represent a significant advancement in open-source AI capabilities for tool use/function calling.

    tools 8b 70b

    91.6K  Pulls 33  Tags Updated  1 year ago

  • qwen2

    Qwen2 is a new series of large language models from Alibaba group

    tools 0.5b 1.5b 7b 72b

    4.3M  Pulls 97  Tags Updated  12 months ago

  • command-r-plus

    Command R+ is a powerful, scalable large language model purpose-built to excel at real-world enterprise use cases.

    tools 104b

    140.5K  Pulls 21  Tags Updated  1 year ago

  • command-r

    Command R is a Large Language Model optimized for conversational interaction and long context tasks.

    tools 35b

    340.5K  Pulls 32  Tags Updated  1 year ago

  • mixtral

    A set of Mixture of Experts (MoE) model with open weights by Mistral AI in 8x7b and 8x22b parameter sizes.

    tools 8x7b 8x22b

    1.3M  Pulls 70  Tags Updated  8 months ago

  • mistral

    The 7B model released by Mistral AI, updated to version 0.3.

    tools 7b

    18.9M  Pulls 84  Tags Updated  1 month ago

© 2025 Ollama
Download Blog Docs GitHub Discord X (Twitter) Contact Us
  • Blog
  • Download
  • Docs
  • GitHub
  • Discord
  • X (Twitter)
  • Meetups
© 2025 Ollama Inc.