Ollama
Models Docs Pricing
Sign in Download
Models Download Docs Pricing Sign in
⇅
Tools models · Ollama
Tools models on Ollama.
  • nemotron-cascade-2

    An open 30B MoE model from NVIDIA with 3B activated parameters that delivers strong reasoning and agentic capabilities.

    tools thinking 30b

    51.3K  Pulls 3  Tags Updated  1 week ago

  • minimax-m2.7

    MiniMax's M2-series model for coding, agentic workflows, and professional productivity.

    tools thinking cloud

    46.3K  Pulls 1  Tag Updated  2 weeks ago

  • gemma4

    Gemma 4 models are designed to deliver frontier-level performance at each size. They are well-suited for reasoning, agentic workflows, coding, and multimodal understanding.

    vision tools audio e2b e4b 26b 31b

    5,433  Pulls 16  Tags Updated  5 minutes ago

  • lfm2

    LFM2 is a family of hybrid models designed for on-device deployment. LFM2-24B-A2B is the largest model in the family, scaling the architecture to 24 billion parameters while keeping inference efficient.

    tools 24b

    1M  Pulls 6  Tags Updated  1 month ago

  • nemotron-3-super

    NVIDIA Nemotron 3 Super is a 120B open MoE model activating just 12B parameters to deliver maximum compute efficiency and accuracy for complex multi-agent applications.

    tools thinking cloud 120b

    173.3K  Pulls 7  Tags Updated  3 weeks ago

  • qwen3.5

    Qwen 3.5 is a family of open-source multimodal models that delivers exceptional utility and performance.

    vision tools thinking cloud 0.8b 2b 4b 9b 27b 35b 122b

    4.5M  Pulls 58  Tags Updated  20 hours ago

  • glm-5

    A strong reasoning and agentic model from Z.ai with 744B total parameters (40B active), built for complex systems engineering and long-horizon tasks.

    tools thinking cloud

    152.8K  Pulls 1  Tag Updated  1 month ago

  • minimax-m2.5

    MiniMax-M2.5 is a state-of-the-art large language model designed for real-world productivity and coding tasks.

    tools thinking cloud

    151.9K  Pulls 1  Tag Updated  1 month ago

  • qwen3-coder-next

    Qwen3-Coder-Next is a coding-focused language model from Alibaba's Qwen team, optimized for agentic coding workflows and local development.

    tools cloud

    989.8K  Pulls 4  Tags Updated  1 month ago

  • glm-ocr

    GLM-OCR is a multimodal OCR model for complex document understanding, built on the GLM-V encoder–decoder architecture.

    vision tools

    175.3K  Pulls 3  Tags Updated  1 month ago

  • kimi-k2.5

    Kimi K2.5 is an open-source, native multimodal agentic model that seamlessly integrates vision and language understanding with advanced agentic capabilities, instant and thinking modes, as well as conversational and agentic paradigms.

    vision tools thinking cloud

    211.4K  Pulls 1  Tag Updated  2 months ago

  • lfm2.5-thinking

    LFM2.5 is a new family of hybrid models designed for on-device deployment.

    tools 1.2b

    1.1M  Pulls 5  Tags Updated  2 months ago

  • glm-4.7-flash

    As the strongest model in the 30B class, GLM-4.7-Flash offers a new option for lightweight deployment that balances performance and efficiency.

    tools thinking

    1M  Pulls 4  Tags Updated  2 months ago

  • glm-4.7

    Advancing the Coding Capability

    tools thinking cloud

    81.1K  Pulls 1  Tag Updated  3 months ago

  • minimax-m2.1

    Exceptional multilingual capabilities to elevate code engineering

    tools cloud

    34.4K  Pulls 1  Tag Updated  3 months ago

  • gemini-3-flash-preview

    Gemini 3 Flash offers frontier intelligence built for speed at a fraction of the cost.

    vision tools thinking cloud

    115.1K  Pulls 2  Tags Updated  3 months ago

  • nemotron-3-nano

    Nemotron-3-Nano is a new Standard for Efficient, Open, and Intelligent Agentic Models, now updated with a 4B parameter count model.

    tools thinking cloud 4b 30b

    333.2K  Pulls 9  Tags Updated  2 weeks ago

  • functiongemma

    FunctionGemma is a specialized version of Google's Gemma 3 270M model fine-tuned explicitly for function calling.

    tools 270m

    126.9K  Pulls 4  Tags Updated  3 months ago

  • olmo-3.1

    Olmo is a series of Open language models designed to enable the science of language models. These models are pre-trained on the Dolma 3 dataset and post-trained on the Dolci datasets.

    tools 32b

    215.2K  Pulls 10  Tags Updated  3 months ago

  • devstral-small-2

    24B model that excels at using tools to explore codebases, editing multiple files and power software engineering agents.

    vision tools cloud 24b

    730.8K  Pulls 6  Tags Updated  3 months ago

© 2026 Ollama
Blog Contact