Ollama
Models Docs Pricing
Sign in Download
Models Download Docs Pricing Sign in
⇅
Cloud models · Ollama
Cloud models on Ollama.
  • devstral-2

    123B model that excels at using tools to explore codebases, editing multiple files and power software engineering agents.

    tools cloud 123b

    230.2K  Pulls 6  Tags Updated  5 months ago

  • qwen3-next

    The first installment in the Qwen3-Next series with strong performance in terms of both parameter efficiency and inference speed.

    tools thinking cloud 80b

    560.9K  Pulls 10  Tags Updated  5 months ago

  • mistral-large-3

    A general-purpose multimodal mixture-of-experts model for production-grade tasks and enterprise workloads.

    vision tools cloud

    59.8K  Pulls 1  Tag Updated  6 months ago

  • ministral-3

    The Ministral 3 family is designed for edge deployment, capable of running on a wide range of hardware.

    vision tools cloud 3b 8b 14b

    1.2M  Pulls 16  Tags Updated  5 months ago

  • cogito-2.1

    The Cogito v2.1 LLMs are instruction tuned generative models. All models are released under MIT license for commercial use.

    cloud 671b

    194.6K  Pulls 6  Tags Updated  6 months ago

  • kimi-k2-thinking

    Kimi K2 Thinking, Moonshot AI's best open-source thinking model.

    tools thinking cloud

    2.1M  Pulls 1  Tag Updated  6 months ago

  • minimax-m2

    MiniMax M2 is a high-efficiency large language model built for coding and agentic workflows.

    tools thinking cloud

    2.2M  Pulls 1  Tag Updated  7 months ago

  • glm-4.6

    Advanced agentic, reasoning and coding capabilities.

    tools thinking cloud

    2.2M  Pulls 1  Tag Updated  7 months ago

  • qwen3-vl

    The most powerful vision-language model in the Qwen model family to date.

    vision tools thinking cloud 2b 4b 8b 30b 32b 235b

    4M  Pulls 59  Tags Updated  7 months ago

  • kimi-k2

    A state-of-the-art mixture-of-experts (MoE) language model. Kimi K2-Instruct-0905 demonstrates significant improvements in performance on public benchmarks and real-world coding agent tasks.

    tools cloud

    75K  Pulls 1  Tag Updated  8 months ago

  • deepseek-v3.1

    DeepSeek-V3.1-Terminus is a hybrid model that supports both thinking mode and non-thinking mode.

    tools thinking cloud 671b

    692K  Pulls 8  Tags Updated  8 months ago

  • gpt-oss

    OpenAI’s open-weight models designed for powerful reasoning, agentic tasks, and versatile developer use cases.

    tools thinking cloud 20b 120b

    10M  Pulls 5  Tags Updated  7 months ago

  • qwen3-coder

    Alibaba's performant long context models for agentic and coding tasks.

    tools cloud 30b 480b

    5.8M  Pulls 10  Tags Updated  8 months ago

  • gemma3

    The current, most capable model that runs on a single GPU.

    vision cloud 270m 1b 4b 12b 27b

    37.3M  Pulls 29  Tags Updated  5 months ago

© 2026 Ollama
Blog Contact