Ollama
Models Docs Pricing
Sign in Download
Models Download Docs Pricing Sign in
⇅
Thinking models · Ollama
Thinking models on Ollama.
  • glm-5.1

    GLM-5.1 is our next-generation flagship model for agentic engineering, with significantly stronger coding capabilities than its predecessor. It achieves state-of-the-art performance on SWE-Bench Pro and leads GLM-5 by a wide margin.

    tools thinking cloud

    24.9K  Pulls 1  Tag Updated  4 days ago

  • gemma4

    Gemma 4 models are designed to deliver frontier-level performance at each size. They are well-suited for reasoning, agentic workflows, coding, and multimodal understanding.

    vision tools thinking audio cloud e2b e4b 26b 31b

    2.5M  Pulls 17  Tags Updated  6 days ago

  • nemotron-cascade-2

    An open 30B MoE model from NVIDIA with 3B activated parameters that delivers strong reasoning and agentic capabilities.

    tools thinking 30b

    89.5K  Pulls 3  Tags Updated  3 weeks ago

  • minimax-m2.7

    MiniMax's M2-series model for coding, agentic workflows, and professional productivity.

    tools thinking cloud

    65.4K  Pulls 1  Tag Updated  3 weeks ago

  • qwen3.5

    Qwen 3.5 is a family of open-source multimodal models that delivers exceptional utility and performance.

    vision tools thinking cloud 0.8b 2b 4b 9b 27b 35b 122b

    5.8M  Pulls 58  Tags Updated  1 week ago

  • glm-4.7-flash

    As the strongest model in the 30B class, GLM-4.7-Flash offers a new option for lightweight deployment that balances performance and efficiency.

    tools thinking

    1.1M  Pulls 4  Tags Updated  2 months ago

  • nemotron-3-super

    NVIDIA Nemotron 3 Super is a 120B open MoE model activating just 12B parameters to deliver maximum compute efficiency and accuracy for complex multi-agent applications.

    tools thinking cloud 120b

    223.6K  Pulls 7  Tags Updated  1 month ago

  • qwen3-next

    The first installment in the Qwen3-Next series with strong performance in terms of both parameter efficiency and inference speed.

    tools thinking cloud 80b

    514.3K  Pulls 10  Tags Updated  4 months ago

  • glm-5

    A strong reasoning and agentic model from Z.ai with 744B total parameters (40B active), built for complex systems engineering and long-horizon tasks.

    tools thinking cloud

    182.2K  Pulls 1  Tag Updated  1 month ago

  • kimi-k2.5

    Kimi K2.5 is an open-source, native multimodal agentic model that seamlessly integrates vision and language understanding with advanced agentic capabilities, instant and thinking modes, as well as conversational and agentic paradigms.

    vision tools thinking cloud

    236.4K  Pulls 1  Tag Updated  2 months ago

  • nemotron-3-nano

    Nemotron-3-Nano is a new Standard for Efficient, Open, and Intelligent Agentic Models, now updated with a 4B parameter count model.

    tools thinking cloud 4b 30b

    383.8K  Pulls 9  Tags Updated  3 weeks ago

  • minimax-m2.5

    MiniMax-M2.5 is a state-of-the-art large language model designed for real-world productivity and coding tasks.

    tools thinking cloud

    159.9K  Pulls 1  Tag Updated  1 month ago

  • gemini-3-flash-preview

    Gemini 3 Flash offers frontier intelligence built for speed at a fraction of the cost.

    vision tools thinking cloud

    129.2K  Pulls 2  Tags Updated  3 months ago

  • glm-4.7

    Advancing the Coding Capability

    tools thinking cloud

    87.9K  Pulls 1  Tag Updated  3 months ago

  • gpt-oss-safeguard

    gpt-oss-safeguard-20b and gpt-oss-safeguard-120b are safety reasoning models built-upon gpt-oss

    tools thinking 20b 120b

    130.8K  Pulls 3  Tags Updated  5 months ago

  • deepseek-v3.2

    DeepSeek-V3.2, a model that harmonizes high computational efficiency with superior reasoning and agent performance.

    tools thinking cloud

    76.4K  Pulls 1  Tag Updated  3 months ago

  • minimax-m2

    MiniMax M2 is a high-efficiency large language model built for coding and agentic workflows.

    tools thinking cloud

    99.4K  Pulls 1  Tag Updated  5 months ago

  • kimi-k2-thinking

    Kimi K2 Thinking, Moonshot AI's best open-source thinking model.

    tools thinking cloud

    54K  Pulls 1  Tag Updated  5 months ago

  • qwen3

    Qwen3 is the latest generation of large language models in Qwen series, offering a comprehensive suite of dense and mixture-of-experts (MoE) models.

    tools thinking 0.6b 1.7b 4b 8b 14b 30b 32b 235b

    26.5M  Pulls 58  Tags Updated  6 months ago

  • gpt-oss

    OpenAI’s open-weight models designed for powerful reasoning, agentic tasks, and versatile developer use cases.

    tools thinking cloud 20b 120b

    8.7M  Pulls 5  Tags Updated  6 months ago

© 2026 Ollama
Blog Contact