Tools models · Ollama

kimi-k2.6

Kimi K2.6 is an open-source, native multimodal agentic model that advances practical capabilities in long-horizon coding, coding-driven design, proactive autonomous execution, and swarm-based task orchestration.

vision tools thinking cloud

48.3K Pulls 1 Tag Updated 6 days ago

deepseek-v4-flash

DeepSeek-V4-Flash is a preview of the DeepSeek-V4 series, a Mixture-of-Experts model with 284B total parameters and 13B activated, built for efficient reasoning across a 1M-token context window.

tools thinking cloud

17.8K Pulls 1 Tag Updated 2 days ago

qwen3.6

Qwen3.6 delivers substantial upgrades in agentic coding and thinking preservation than previous Qwen models.

vision tools thinking 27b 35b

535.2K Pulls 22 Tags Updated 4 days ago

glm-5.1

GLM-5.1 is our next-generation flagship model for agentic engineering, with significantly stronger coding capabilities than its predecessor. It achieves state-of-the-art performance on SWE-Bench Pro and leads GLM-5 by a wide margin.

tools thinking cloud

114.7K Pulls 1 Tag Updated 2 weeks ago

gemma4

Gemma 4 models are designed to deliver frontier-level performance at each size. They are well-suited for reasoning, agentic workflows, coding, and multimodal understanding.

vision tools thinking audio cloud e2b e4b 26b 31b

5.6M Pulls 29 Tags Updated 1 week ago

qwen3.5

Qwen 3.5 is a family of open-source multimodal models that delivers exceptional utility and performance.

vision tools thinking cloud 0.8b 2b 4b 9b 27b 35b 122b

7.2M Pulls 58 Tags Updated 3 weeks ago

lfm2

LFM2 is a family of hybrid models designed for on-device deployment. LFM2-24B-A2B is the largest model in the family, scaling the architecture to 24 billion parameters while keeping inference efficient.

tools 24b

1.1M Pulls 6 Tags Updated 2 months ago

qwen3-coder-next

Qwen3-Coder-Next is a coding-focused language model from Alibaba's Qwen team, optimized for agentic coding workflows and local development.

tools cloud

1.1M Pulls 4 Tags Updated 2 months ago

glm-4.7-flash

As the strongest model in the 30B class, GLM-4.7-Flash offers a new option for lightweight deployment that balances performance and efficiency.

tools thinking

1.2M Pulls 4 Tags Updated 3 months ago

lfm2.5-thinking

LFM2.5 is a new family of hybrid models designed for on-device deployment.

tools 1.2b

1.1M Pulls 5 Tags Updated 3 months ago

ministral-3

The Ministral 3 family is designed for edge deployment, capable of running on a wide range of hardware.

vision tools cloud 3b 8b 14b

1M Pulls 16 Tags Updated 4 months ago

devstral-small-2

24B model that excels at using tools to explore codebases, editing multiple files and power software engineering agents.

vision tools cloud 24b

793K Pulls 6 Tags Updated 4 months ago

nemotron-3-super

NVIDIA Nemotron 3 Super is a 120B open MoE model activating just 12B parameters to deliver maximum compute efficiency and accuracy for complex multi-agent applications.

tools thinking cloud 120b

245.2K Pulls 7 Tags Updated 1 month ago

glm-ocr

GLM-OCR is a multimodal OCR model for complex document understanding, built on the GLM-V encoder–decoder architecture.

vision tools

282.9K Pulls 3 Tags Updated 2 months ago

nemotron-cascade-2

An open 30B MoE model from NVIDIA with 3B activated parameters that delivers strong reasoning and agentic capabilities.

tools thinking 30b

105.5K Pulls 3 Tags Updated 1 month ago

qwen3-next

The first installment in the Qwen3-Next series with strong performance in terms of both parameter efficiency and inference speed.

tools thinking cloud 80b

528.4K Pulls 10 Tags Updated 4 months ago

kimi-k2.5

Kimi K2.5 is an open-source, native multimodal agentic model that seamlessly integrates vision and language understanding with advanced agentic capabilities, instant and thinking modes, as well as conversational and agentic paradigms.

vision tools thinking cloud

260.6K Pulls 1 Tag Updated 2 months ago

rnj-1

Rnj-1 is a family of 8B parameter open-weight, dense models trained from scratch by Essential AI, optimized for code and STEM with capabilities on par with SOTA open-weight models.

tools cloud 8b

450.2K Pulls 6 Tags Updated 4 months ago

minimax-m2.7

MiniMax's M2-series model for coding, agentic workflows, and professional productivity.

tools thinking cloud

97K Pulls 1 Tag Updated 1 month ago

nemotron-3-nano

Nemotron-3-Nano is a new Standard for Efficient, Open, and Intelligent Agentic Models, now updated with a 4B parameter count model.

tools thinking cloud 4b 30b

408.8K Pulls 9 Tags Updated 1 month ago