Ollama

nemotron-3-super

NVIDIA Nemotron 3 Super is a 120B open MoE model activating just 12B parameters to deliver maximum compute efficiency and accuracy for complex multi-agent applications.

tools thinking cloud 120b

256.5K Pulls 7 Tags Updated 1 month ago

glm-ocr

GLM-OCR is a multimodal OCR model for complex document understanding, built on the GLM-V encoder–decoder architecture.

vision tools

291.8K Pulls 3 Tags Updated 2 months ago

qwen3-next

The first installment in the Qwen3-Next series with strong performance in terms of both parameter efficiency and inference speed.

tools thinking cloud 80b

531.5K Pulls 10 Tags Updated 4 months ago

nemotron-cascade-2

An open 30B MoE model from NVIDIA with 3B activated parameters that delivers strong reasoning and agentic capabilities.

tools thinking 30b

108K Pulls 3 Tags Updated 1 month ago

kimi-k2.5

Kimi K2.5 is an open-source, native multimodal agentic model that seamlessly integrates vision and language understanding with advanced agentic capabilities, instant and thinking modes, as well as conversational and agentic paradigms.

vision tools thinking cloud

265.6K Pulls 1 Tag Updated 3 months ago

rnj-1

Rnj-1 is a family of 8B parameter open-weight, dense models trained from scratch by Essential AI, optimized for code and STEM with capabilities on par with SOTA open-weight models.

tools cloud 8b

452.2K Pulls 6 Tags Updated 4 months ago

nemotron-3-nano

Nemotron-3-Nano is a new Standard for Efficient, Open, and Intelligent Agentic Models, now updated with a 4B parameter count model.

tools thinking cloud 4b 30b

415.3K Pulls 9 Tags Updated 1 month ago

minimax-m2.7

MiniMax's M2-series model for coding, agentic workflows, and professional productivity.

tools thinking cloud

103K Pulls 1 Tag Updated 1 month ago

olmo-3

Olmo is a series of Open language models designed to enable the science of language models. These models are pre-trained on the Dolma 3 dataset and post-trained on the Dolci datasets.

7b 32b

417.8K Pulls 15 Tags Updated 4 months ago

glm-5

A strong reasoning and agentic model from Z.ai with 744B total parameters (40B active), built for complex systems engineering and long-horizon tasks.

tools thinking cloud

198.1K Pulls 1 Tag Updated 2 months ago

deepseek-ocr

DeepSeek-OCR is a vision-language model that can perform token-efficient OCR.

vision 3b

432.4K Pulls 3 Tags Updated 5 months ago

minimax-m2.5

MiniMax-M2.5 is a state-of-the-art large language model designed for real-world productivity and coding tasks.

tools thinking cloud

168.9K Pulls 1 Tag Updated 2 months ago

olmo-3.1

Olmo is a series of Open language models designed to enable the science of language models. These models are pre-trained on the Dolma 3 dataset and post-trained on the Dolci datasets.

tools 32b

266.4K Pulls 10 Tags Updated 4 months ago

devstral-2

123B model that excels at using tools to explore codebases, editing multiple files and power software engineering agents.

tools cloud 123b

206.7K Pulls 6 Tags Updated 4 months ago

nomic-embed-text-v2-moe

nomic-embed-text-v2-moe is a multilingual MoE text embedding model that excels at multilingual retrieval.

embedding

194.1K Pulls 1 Tag Updated 4 months ago

functiongemma

FunctionGemma is a specialized version of Google's Gemma 3 270M model fine-tuned explicitly for function calling.

tools 270m

152.6K Pulls 4 Tags Updated 4 months ago

gemini-3-flash-preview

Gemini 3 Flash offers frontier intelligence built for speed at a fraction of the cost.

vision tools thinking cloud

145K Pulls 2 Tags Updated 4 months ago

cogito-2.1

The Cogito v2.1 LLMs are instruction tuned generative models. All models are released under MIT license for commercial use.

cloud 671b

177.1K Pulls 6 Tags Updated 5 months ago

glm-4.7

Advancing the Coding Capability

tools thinking cloud

96K Pulls 1 Tag Updated 4 months ago

deepseek-v3.2

DeepSeek-V3.2, a model that harmonizes high computational efficiency with superior reasoning and agent performance.

tools thinking cloud

89.2K Pulls 1 Tag Updated 4 months ago