Cloud models · Ollama

devstral-2

123B model that excels at using tools to explore codebases, editing multiple files and power software engineering agents.

tools cloud 123b

230.2K Pulls 6 Tags Updated 5 months ago

qwen3-next

The first installment in the Qwen3-Next series with strong performance in terms of both parameter efficiency and inference speed.

tools thinking cloud 80b

560.9K Pulls 10 Tags Updated 5 months ago

mistral-large-3

A general-purpose multimodal mixture-of-experts model for production-grade tasks and enterprise workloads.

vision tools cloud

59.8K Pulls 1 Tag Updated 6 months ago

ministral-3

The Ministral 3 family is designed for edge deployment, capable of running on a wide range of hardware.

vision tools cloud 3b 8b 14b

1.2M Pulls 16 Tags Updated 5 months ago

cogito-2.1

The Cogito v2.1 LLMs are instruction tuned generative models. All models are released under MIT license for commercial use.

cloud 671b

194.6K Pulls 6 Tags Updated 6 months ago

kimi-k2-thinking

Kimi K2 Thinking, Moonshot AI's best open-source thinking model.

tools thinking cloud

2.1M Pulls 1 Tag Updated 6 months ago

minimax-m2

MiniMax M2 is a high-efficiency large language model built for coding and agentic workflows.

tools thinking cloud

2.2M Pulls 1 Tag Updated 7 months ago

glm-4.6

Advanced agentic, reasoning and coding capabilities.

tools thinking cloud

2.2M Pulls 1 Tag Updated 7 months ago

qwen3-vl

The most powerful vision-language model in the Qwen model family to date.

vision tools thinking cloud 2b 4b 8b 30b 32b 235b

4M Pulls 59 Tags Updated 7 months ago

kimi-k2

A state-of-the-art mixture-of-experts (MoE) language model. Kimi K2-Instruct-0905 demonstrates significant improvements in performance on public benchmarks and real-world coding agent tasks.

tools cloud

75K Pulls 1 Tag Updated 8 months ago

deepseek-v3.1

DeepSeek-V3.1-Terminus is a hybrid model that supports both thinking mode and non-thinking mode.

tools thinking cloud 671b

692K Pulls 8 Tags Updated 8 months ago

gpt-oss

OpenAI’s open-weight models designed for powerful reasoning, agentic tasks, and versatile developer use cases.

tools thinking cloud 20b 120b

10M Pulls 5 Tags Updated 7 months ago

qwen3-coder

Alibaba's performant long context models for agentic and coding tasks.

tools cloud 30b 480b

5.8M Pulls 10 Tags Updated 8 months ago

gemma3

The current, most capable model that runs on a single GPU.

vision cloud 270m 1b 4b 12b 27b

37.3M Pulls 29 Tags Updated 5 months ago