Ollama
Models Docs Pricing
Sign in Download
Models Download Docs Pricing Sign in
⇅
Thinking models · Ollama
Thinking models on Ollama.
  • mistral-medium-3.5

    Mistral Medium 3.5 is the first flagship model of Mistral AI that merged instruction-following, reasoning, and coding in a single set of 128B weights.

    vision tools thinking 128b

    10K  Pulls 5  Tags Updated  6 days ago

  • nemotron3

    NVIDIA Nemotron 3 Nano Omni is a multimodal large language model that unifies video, audio, image, and text understanding to support enterprise-grade Q&A, summarization, transcription, and document intelligence workflows.

    vision tools thinking audio 33b

    426.8K  Pulls 4  Tags Updated  1 week ago

  • qwen3.6

    Qwen3.6 delivers substantial upgrades in agentic coding and thinking preservation than previous Qwen models.

    vision tools thinking 27b 35b

    825.8K  Pulls 22  Tags Updated  1 week ago

  • kimi-k2.6

    Kimi K2.6 is an open-source, native multimodal agentic model that advances practical capabilities in long-horizon coding, coding-driven design, proactive autonomous execution, and swarm-based task orchestration.

    vision tools thinking cloud

    133.6K  Pulls 1  Tag Updated  2 weeks ago

  • glm-5.1

    GLM-5.1 is our next-generation flagship model for agentic engineering, with significantly stronger coding capabilities than its predecessor. It achieves state-of-the-art performance on SWE-Bench Pro and leads GLM-5 by a wide margin.

    tools thinking cloud

    181.6K  Pulls 1  Tag Updated  3 weeks ago

  • deepseek-v4-flash

    DeepSeek-V4-Flash is a preview of the DeepSeek-V4 series, a Mixture-of-Experts model with 284B total parameters and 13B activated, built for efficient reasoning across a 1M-token context window.

    tools thinking cloud

    43.8K  Pulls 1  Tag Updated  1 week ago

  • deepseek-v4-pro

    DeepSeek-V4-Pro is a frontier Mixture-of-Experts model with a 1M-token context window and three reasoning modes.

    tools thinking cloud

    34.6K  Pulls 1  Tag Updated  1 week ago

  • laguna-xs.2

    Laguna XS.2 is a 33B total parameter Mixture-of-Experts model with 3B activated parameters per token designed for agentic coding and long-horizon work on a local machine.

    tools thinking

    5,637  Pulls 7  Tags Updated  1 week ago

  • gemma4

    Gemma 4 models are designed to deliver frontier-level performance at each size. They are well-suited for reasoning, agentic workflows, coding, and multimodal understanding.

    vision tools thinking audio cloud e2b e4b 26b 31b

    6.8M  Pulls 29  Tags Updated  2 weeks ago

  • qwen3.5

    Qwen 3.5 is a family of open-source multimodal models that delivers exceptional utility and performance.

    vision tools thinking cloud 0.8b 2b 4b 9b 27b 35b 122b

    8.3M  Pulls 58  Tags Updated  1 month ago

  • glm-4.7-flash

    As the strongest model in the 30B class, GLM-4.7-Flash offers a new option for lightweight deployment that balances performance and efficiency.

    tools thinking

    1.2M  Pulls 4  Tags Updated  3 months ago

  • nemotron-3-super

    NVIDIA Nemotron 3 Super is a 120B open MoE model activating just 12B parameters to deliver maximum compute efficiency and accuracy for complex multi-agent applications.

    tools thinking cloud 120b

    325.1K  Pulls 7  Tags Updated  1 month ago

  • minimax-m2.7

    MiniMax's M2-series model for coding, agentic workflows, and professional productivity.

    tools thinking cloud

    158.9K  Pulls 1  Tag Updated  1 month ago

  • glm-5

    A strong reasoning and agentic model from Z.ai with 744B total parameters (40B active), built for complex systems engineering and long-horizon tasks.

    tools thinking cloud

    251.6K  Pulls 1  Tag Updated  2 months ago

  • qwen3-next

    The first installment in the Qwen3-Next series with strong performance in terms of both parameter efficiency and inference speed.

    tools thinking cloud 80b

    536.4K  Pulls 10  Tags Updated  4 months ago

  • nemotron-cascade-2

    An open 30B MoE model from NVIDIA with 3B activated parameters that delivers strong reasoning and agentic capabilities.

    tools thinking 30b

    110.2K  Pulls 3  Tags Updated  1 month ago

  • minimax-m2.5

    MiniMax-M2.5 is a state-of-the-art large language model designed for real-world productivity and coding tasks.

    tools thinking cloud

    222.3K  Pulls 1  Tag Updated  2 months ago

  • kimi-k2.5

    Kimi K2.5 is an open-source, native multimodal agentic model that seamlessly integrates vision and language understanding with advanced agentic capabilities, instant and thinking modes, as well as conversational and agentic paradigms.

    vision tools thinking cloud

    270.1K  Pulls 1  Tag Updated  3 months ago

  • nemotron-3-nano

    Nemotron-3-Nano is a new Standard for Efficient, Open, and Intelligent Agentic Models, now updated with a 4B parameter count model.

    tools thinking cloud 4b 30b

    422.2K  Pulls 9  Tags Updated  1 month ago

  • gemini-3-flash-preview

    Gemini 3 Flash offers frontier intelligence built for speed at a fraction of the cost.

    vision tools thinking cloud

    199.5K  Pulls 2  Tags Updated  4 months ago

© 2026 Ollama
Blog Contact