Ollama
Models GitHub Discord Docs Cloud
Sign in Download
Models Download GitHub Discord Docs Cloud Sign in
⇅
Cloud models · Ollama Search
Search for Cloud models on Ollama.
  • glm-4.7

    Advancing the Coding Capability

    cloud

    3,217  Pulls 1  Tag Updated  4 days ago

  • minimax-m2.1

    Exceptional multilingual capabilities to elevate code engineering

    cloud

    2,108  Pulls 1  Tag Updated  4 days ago

  • gemini-3-flash-preview

    Gemini 3 Flash offers frontier intelligence built for speed at a fraction of the cost.

    cloud

    12.4K  Pulls 2  Tags Updated  1 week ago

  • nemotron-3-nano

    Nemotron 3 Nano - A new Standard for Efficient, Open, and Intelligent Agentic Models

    cloud 30b

    51.3K  Pulls 6  Tags Updated  1 week ago

  • devstral-small-2

    24B model that excels at using tools to explore codebases, editing multiple files and power software engineering agents.

    vision tools cloud 24b

    54.4K  Pulls 6  Tags Updated  2 weeks ago

  • rnj-1

    Rnj-1 is a family of 8B parameter open-weight, dense models trained from scratch by Essential AI, optimized for code and STEM with capabilities on par with SOTA open-weight models.

    tools cloud 8b

    16.8K  Pulls 6  Tags Updated  2 weeks ago

  • deepseek-v3.2

    DeepSeek-V3.2, a model that harmonizes high computational efficiency with superior reasoning and agent performance.

    cloud

    7,086  Pulls 1  Tag Updated  1 week ago

  • devstral-2

    123B model that excels at using tools to explore codebases, editing multiple files and power software engineering agents.

    tools cloud 123b

    20.9K  Pulls 6  Tags Updated  2 weeks ago

  • qwen3-next

    The first installment in the Qwen3-Next series with strong performance in terms of both parameter efficiency and inference speed.

    tools thinking cloud 80b

    218.2K  Pulls 10  Tags Updated  2 weeks ago

  • mistral-large-3

    A general-purpose multimodal mixture-of-experts model for production-grade tasks and enterprise workloads.

    cloud

    6,148  Pulls 1  Tag Updated  3 weeks ago

  • ministral-3

    The Ministral 3 family is designed for edge deployment, capable of running on a wide range of hardware.

    vision tools cloud 3b 8b 14b

    146K  Pulls 16  Tags Updated  2 weeks ago

  • cogito-2.1

    The Cogito v2.1 LLMs are instruction tuned generative models. All models are released under MIT license for commercial use.

    cloud 671b

    24.2K  Pulls 6  Tags Updated  1 month ago

  • gemini-3-pro-preview

    Google's most intelligent model with SOTA reasoning and multimodal understanding, and powerful agentic and vibe coding capabilities.

    cloud

    45.6K  Pulls 1  Tag Updated  1 month ago

  • kimi-k2-thinking

    Kimi K2 Thinking, Moonshot AI's best open-source thinking model.

    cloud

    16.1K  Pulls 1  Tag Updated  1 month ago

  • minimax-m2

    MiniMax M2 is a high-efficiency large language model built for coding and agentic workflows.

    cloud

    30.9K  Pulls 1  Tag Updated  2 months ago

  • glm-4.6

    Advanced agentic, reasoning and coding capabilities.

    cloud

    37.7K  Pulls 1  Tag Updated  2 months ago

  • qwen3-vl

    The most powerful vision-language model in the Qwen model family to date.

    vision tools cloud 2b 4b 8b 30b 32b 235b

    867.1K  Pulls 59  Tags Updated  1 month ago

  • kimi-k2

    A state-of-the-art mixture-of-experts (MoE) language model. Kimi K2-Instruct-0905 demonstrates significant improvements in performance on public benchmarks and real-world coding agent tasks.

    cloud

    23K  Pulls 1  Tag Updated  3 months ago

  • deepseek-v3.1

    DeepSeek-V3.1-Terminus is a hybrid model that supports both thinking mode and non-thinking mode.

    tools thinking cloud 671b

    222.7K  Pulls 8  Tags Updated  3 months ago

  • gpt-oss

    OpenAI’s open-weight models designed for powerful reasoning, agentic tasks, and versatile developer use cases.

    tools thinking cloud 20b 120b

    5.5M  Pulls 5  Tags Updated  2 months ago

  • qwen3-coder

    Alibaba's performant long context models for agentic and coding tasks.

    tools cloud 30b 480b

    1.5M  Pulls 10  Tags Updated  3 months ago

  • gemma3

    The current, most capable model that runs on a single GPU.

    vision cloud 270m 1b 4b 12b 27b

    28.8M  Pulls 29  Tags Updated  3 weeks ago

© 2025 Ollama
Download Blog Docs GitHub Discord X (Twitter) Contact Us
  • Blog
  • Download
  • Docs
  • GitHub
  • Discord
  • X (Twitter)
  • Meetups
© 2025 Ollama Inc.