Ollama
Models GitHub Discord Docs Cloud
Sign in Download
Models Download GitHub Discord Docs Cloud Sign in
⇅
Tools models · Ollama Search
Search for Tools models on Ollama.
  • gpt-oss

    OpenAI’s open-weight models designed for powerful reasoning, agentic tasks, and versatile developer use cases.

    tools thinking cloud 20b 120b

    3.9M  Pulls 5  Tags Updated  2 weeks ago

  • deepseek-r1

    DeepSeek-R1 is a family of open reasoning models with performance approaching that of leading models, such as O3 and Gemini 2.5 Pro.

    tools thinking 1.5b 7b 8b 14b 32b 70b 671b

    68.5M  Pulls 35  Tags Updated  4 months ago

  • qwen3

    Qwen3 is the latest generation of large language models in Qwen series, offering a comprehensive suite of dense and mixture-of-experts (MoE) models.

    tools thinking 0.6b 1.7b 4b 8b 14b 30b 32b 235b

    11.8M  Pulls 58  Tags Updated  2 weeks ago

  • deepseek-v3.1

    DeepSeek-V3.1-Terminus is a hybrid model that supports both thinking mode and non-thinking mode.

    tools thinking cloud 671b

    121.9K  Pulls 8  Tags Updated  1 month ago

  • llama3.1

    Llama 3.1 is a new state-of-the-art model from Meta available in 8B, 70B and 405B parameter sizes.

    tools 8b 70b 405b

    105.1M  Pulls 93  Tags Updated  11 months ago

  • llama3.2

    Meta's Llama 3.2 goes small with 1B and 3B models.

    tools 1b 3b

    42.7M  Pulls 63  Tags Updated  1 year ago

  • mistral

    The 7B model released by Mistral AI, updated to version 0.3.

    tools 7b

    21.4M  Pulls 84  Tags Updated  3 months ago

  • qwen2.5

    Qwen2.5 models are pretrained on Alibaba's latest large-scale dataset, encompassing up to 18 trillion tokens. The model supports up to 128K tokens and has multilingual support.

    tools 0.5b 1.5b 3b 7b 14b 32b 72b

    15.9M  Pulls 133  Tags Updated  1 year ago

  • qwen2.5-coder

    The latest series of Code-Specific Qwen models, with significant improvements in code generation, code reasoning, and code fixing.

    tools 0.5b 1.5b 3b 7b 14b 32b

    8M  Pulls 199  Tags Updated  5 months ago

  • qwen2

    Qwen2 is a new series of large language models from Alibaba group

    tools 0.5b 1.5b 7b 72b

    4.4M  Pulls 97  Tags Updated  1 year ago

  • mistral-nemo

    A state-of-the-art 12B model with 128k context length, built by Mistral AI in collaboration with NVIDIA.

    tools 12b

    2.8M  Pulls 17  Tags Updated  3 months ago

  • llama3.3

    New state of the art 70B model. Llama 3.3 70B offers similar performance compared to the Llama 3.1 405B model.

    tools 70b

    2.7M  Pulls 14  Tags Updated  10 months ago

  • mistral-small

    Mistral Small 3 sets a new benchmark in the “small” Large Language Models category below 70B.

    tools 22b 24b

    2.1M  Pulls 21  Tags Updated  9 months ago

  • smollm2

    SmolLM2 is a family of compact language models available in three size: 135M, 360M, and 1.7B parameters.

    tools 135m 360m 1.7b

    2M  Pulls 49  Tags Updated  12 months ago

  • qwq

    QwQ is the reasoning model of the Qwen series.

    tools 32b

    1.7M  Pulls 8  Tags Updated  7 months ago

  • mixtral

    A set of Mixture of Experts (MoE) model with open weights by Mistral AI in 8x7b and 8x22b parameter sizes.

    tools 8x7b 8x22b

    1.4M  Pulls 70  Tags Updated  10 months ago

  • granite3.1-moe

    The IBM Granite 1B and 3B models are long-context mixture of experts (MoE) Granite models from IBM designed for low latency usage.

    tools 1b 3b

    1M  Pulls 33  Tags Updated  9 months ago

  • llama4

    Meta's latest collection of multimodal models.

    vision tools 16x17b 128x17b

    755.7K  Pulls 11  Tags Updated  4 months ago

  • mistral-small3.2

    An update to Mistral Small that improves on function calling, instruction following, and less repetition errors.

    vision tools 24b

    719.7K  Pulls 5  Tags Updated  4 months ago

  • granite3.3

    IBM Granite 2B and 8B models are 128K context length language models that have been fine-tuned for improved reasoning and instruction-following capabilities.

    tools 2b 8b

    665K  Pulls 3  Tags Updated  6 months ago

  • cogito

    Cogito v1 Preview is a family of hybrid reasoning models by Deep Cogito that outperform the best available open models of the same size, including counterparts from LLaMA, DeepSeek, and Qwen across most standard benchmarks.

    tools 3b 8b 14b 32b 70b

    663K  Pulls 20  Tags Updated  6 months ago

  • magistral

    Magistral is a small, efficient reasoning model with 24B parameters.

    tools thinking 24b

    587.9K  Pulls 5  Tags Updated  4 months ago

  • phi4-mini

    Phi-4-mini brings significant enhancements in multilingual support, reasoning, and mathematics, and now, the long-awaited function calling feature is finally supported.

    tools 3.8b

    462.2K  Pulls 5  Tags Updated  8 months ago

  • devstral

    Devstral: the best open source model for coding agents

    tools 24b

    425.5K  Pulls 5  Tags Updated  3 months ago

  • granite3.2-vision

    A compact and efficient vision-language model, specifically designed for visual document understanding, enabling automated content extraction from tables, charts, infographics, plots, diagrams, and more.

    vision tools 2b

    414.8K  Pulls 5  Tags Updated  8 months ago

  • command-r

    Command R is a Large Language Model optimized for conversational interaction and long context tasks.

    tools 35b

    361.6K  Pulls 32  Tags Updated  1 year ago

  • mistral-small3.1

    Building upon Mistral Small 3, Mistral Small 3.1 (2503) adds state-of-the-art vision understanding and enhances long context capabilities up to 128k tokens without compromising text performance.

    vision tools 24b

    345.7K  Pulls 5  Tags Updated  6 months ago

  • hermes3

    Hermes 3 is the latest version of the flagship Hermes series of LLMs by Nous Research

    tools 3b 8b 70b 405b

    343.5K  Pulls 65  Tags Updated  10 months ago

  • mistral-large

    Mistral Large 2 is Mistral's new flagship model that is significantly more capable in code generation, mathematics, and reasoning with 128k context window and support for dozens of languages.

    tools 123b

    266.2K  Pulls 32  Tags Updated  11 months ago

  • granite3.2

    Granite-3.2 is a family of long-context AI models from IBM Granite fine-tuned for thinking capabilities.

    tools 2b 8b

    162.6K  Pulls 9  Tags Updated  8 months ago

  • command-r-plus

    Command R+ is a powerful, scalable large language model purpose-built to excel at real-world enterprise use cases.

    tools 104b

    145.7K  Pulls 21  Tags Updated  1 year ago

  • granite3-dense

    The IBM Granite 2B and 8B models are designed to support tool-based use cases and support for retrieval augmented generation (RAG), streamlining code generation, translation and bug fixing.

    tools 2b 8b

    128K  Pulls 33  Tags Updated  11 months ago

  • granite3.1-dense

    The IBM Granite 2B and 8B models are text-only dense LLMs trained on over 12 trillion tokens of data, demonstrated significant improvements over their predecessors in performance and speed in IBM’s initial testing.

    tools 2b 8b

    124K  Pulls 33  Tags Updated  9 months ago

  • nemotron-mini

    A commercial-friendly small language model by NVIDIA optimized for roleplay, RAG QA, and function calling.

    tools 4b

    109.8K  Pulls 17  Tags Updated  1 year ago

  • athene-v2

    Athene-V2 is a 72B parameter model which excels at code completion, mathematics, and log extraction tasks.

    tools 72b

    100.6K  Pulls 17  Tags Updated  11 months ago

  • llama3-groq-tool-use

    A series of models from Groq that represent a significant advancement in open-source AI capabilities for tool use/function calling.

    tools 8b 70b

    100.3K  Pulls 33  Tags Updated  1 year ago

  • nemotron

    Llama-3.1-Nemotron-70B-Instruct is a large language model customized by NVIDIA to improve the helpfulness of LLM generated responses to user queries.

    tools 70b

    97.8K  Pulls 17  Tags Updated  1 year ago

  • granite3-moe

    The IBM Granite 1B and 3B models are the first mixture of experts (MoE) Granite models from IBM designed for low latency usage.

    tools 1b 3b

    92.2K  Pulls 33  Tags Updated  11 months ago

  • aya-expanse

    Cohere For AI's language models trained to perform well across 23 different languages.

    tools 8b 32b

    88.3K  Pulls 33  Tags Updated  1 year ago

  • command-r7b

    The smallest model in Cohere's R series delivers top-tier speed, efficiency, and quality to build powerful AI applications on commodity GPUs and edge devices.

    tools 7b

    69.4K  Pulls 5  Tags Updated  9 months ago

  • granite4

    Granite 4 features improved instruction following (IF) and tool-calling capabilities, making them more effective in enterprise applications.

    tools 350m 1b 3b

    69.2K  Pulls 17  Tags Updated  yesterday

  • command-a

    111 billion parameter model optimized for demanding enterprises that require fast, secure, and high-quality AI

    tools 111b

    60.4K  Pulls 5  Tags Updated  7 months ago

  • firefunction-v2

    An open weights function calling model based on Llama 3, competitive with GPT-4o function calling capabilities.

    tools 70b

    36.7K  Pulls 17  Tags Updated  1 year ago

  • command-r7b-arabic

    A new state-of-the-art version of the lightweight Command R7B model that excels in advanced Arabic language capabilities for enterprises in the Middle East and Northern Africa.

    tools 7b

    21.5K  Pulls 5  Tags Updated  8 months ago

  • gpt-oss-safeguard

    gpt-oss-safeguard-20b and gpt-oss-safeguard-120b are safety reasoning models built-upon gpt-oss

    tools thinking 20b 120b

    675  Pulls 3  Tags Updated  yesterday

© 2025 Ollama
Download Blog Docs GitHub Discord X (Twitter) Contact Us
  • Blog
  • Download
  • Docs
  • GitHub
  • Discord
  • X (Twitter)
  • Meetups
© 2025 Ollama Inc.