Ollama

nemotron-cascade-2

An open 30B MoE model from NVIDIA with 3B activated parameters that delivers strong reasoning and agentic capabilities.

tools thinking 30b

45.3K Pulls 3 Tags Updated 1 week ago

minimax-m2.7

MiniMax's M2-series model for coding, agentic workflows, and professional productivity.

tools thinking cloud

42.7K Pulls 1 Tag Updated 1 week ago

lfm2

LFM2 is a family of hybrid models designed for on-device deployment. LFM2-24B-A2B is the largest model in the family, scaling the architecture to 24 billion parameters while keeping inference efficient.

tools 24b

1M Pulls 6 Tags Updated 1 month ago

nemotron-3-super

NVIDIA Nemotron 3 Super is a 120B open MoE model activating just 12B parameters to deliver maximum compute efficiency and accuracy for complex multi-agent applications.

tools thinking cloud 120b

165.2K Pulls 7 Tags Updated 2 weeks ago

qwen3.5

Qwen 3.5 is a family of open-source multimodal models that delivers exceptional utility and performance.

vision tools thinking cloud 0.8b 2b 4b 9b 27b 35b 122b

4.2M Pulls 58 Tags Updated yesterday

glm-5

A strong reasoning and agentic model from Z.ai with 744B total parameters (40B active), built for complex systems engineering and long-horizon tasks.

tools thinking cloud

149.3K Pulls 1 Tag Updated 1 month ago

minimax-m2.5

MiniMax-M2.5 is a state-of-the-art large language model designed for real-world productivity and coding tasks.

tools thinking cloud

150.5K Pulls 1 Tag Updated 1 month ago

qwen3-coder-next

Qwen3-Coder-Next is a coding-focused language model from Alibaba's Qwen team, optimized for agentic coding workflows and local development.

tools cloud

981.1K Pulls 4 Tags Updated 1 month ago

glm-ocr

GLM-OCR is a multimodal OCR model for complex document understanding, built on the GLM-V encoder–decoder architecture.

vision tools

170.9K Pulls 3 Tags Updated 1 month ago

kimi-k2.5

Kimi K2.5 is an open-source, native multimodal agentic model that seamlessly integrates vision and language understanding with advanced agentic capabilities, instant and thinking modes, as well as conversational and agentic paradigms.

vision tools thinking cloud

205.9K Pulls 1 Tag Updated 2 months ago

lfm2.5-thinking

LFM2.5 is a new family of hybrid models designed for on-device deployment.

tools 1.2b

1.1M Pulls 5 Tags Updated 2 months ago

glm-4.7-flash

As the strongest model in the 30B class, GLM-4.7-Flash offers a new option for lightweight deployment that balances performance and efficiency.

tools thinking

994K Pulls 4 Tags Updated 2 months ago

translategemma

A new collection of open translation models built on Gemma 3, helping people communicate across 55 languages.

vision 4b 12b 27b

884.3K Pulls 13 Tags Updated 2 months ago

glm-4.7

Advancing the Coding Capability

tools thinking cloud

79.9K Pulls 1 Tag Updated 3 months ago

minimax-m2.1

Exceptional multilingual capabilities to elevate code engineering

tools cloud

33.7K Pulls 1 Tag Updated 3 months ago

gemini-3-flash-preview

Gemini 3 Flash offers frontier intelligence built for speed at a fraction of the cost.

vision tools thinking cloud

112.7K Pulls 2 Tags Updated 3 months ago

nemotron-3-nano

Nemotron-3-Nano is a new Standard for Efficient, Open, and Intelligent Agentic Models, now updated with a 4B parameter count model.

tools thinking cloud 4b 30b

324.8K Pulls 9 Tags Updated 2 weeks ago

functiongemma

FunctionGemma is a specialized version of Google's Gemma 3 270M model fine-tuned explicitly for function calling.

tools 270m

123.9K Pulls 4 Tags Updated 3 months ago

olmo-3.1

Olmo is a series of Open language models designed to enable the science of language models. These models are pre-trained on the Dolma 3 dataset and post-trained on the Dolci datasets.

tools 32b

208.6K Pulls 10 Tags Updated 3 months ago

olmo-3

Olmo is a series of Open language models designed to enable the science of language models. These models are pre-trained on the Dolma 3 dataset and post-trained on the Dolci datasets.

7b 32b

330K Pulls 15 Tags Updated 3 months ago