Ollama
Models Docs Pricing
Sign in Download
Models Download Docs Pricing Sign in
⇅
Ollama
Search for models on Ollama.
  • nemotron-3-super

    NVIDIA Nemotron 3 Super is a 120B open MoE model activating just 12B parameters to deliver maximum compute efficiency and accuracy for complex multi-agent applications.

    tools thinking cloud 120b

    256.5K  Pulls 7  Tags Updated  1 month ago

  • glm-ocr

    GLM-OCR is a multimodal OCR model for complex document understanding, built on the GLM-V encoder–decoder architecture.

    vision tools

    291.8K  Pulls 3  Tags Updated  2 months ago

  • qwen3-next

    The first installment in the Qwen3-Next series with strong performance in terms of both parameter efficiency and inference speed.

    tools thinking cloud 80b

    531.5K  Pulls 10  Tags Updated  4 months ago

  • nemotron-cascade-2

    An open 30B MoE model from NVIDIA with 3B activated parameters that delivers strong reasoning and agentic capabilities.

    tools thinking 30b

    108K  Pulls 3  Tags Updated  1 month ago

  • kimi-k2.5

    Kimi K2.5 is an open-source, native multimodal agentic model that seamlessly integrates vision and language understanding with advanced agentic capabilities, instant and thinking modes, as well as conversational and agentic paradigms.

    vision tools thinking cloud

    265.6K  Pulls 1  Tag Updated  3 months ago

  • rnj-1

    Rnj-1 is a family of 8B parameter open-weight, dense models trained from scratch by Essential AI, optimized for code and STEM with capabilities on par with SOTA open-weight models.

    tools cloud 8b

    452.2K  Pulls 6  Tags Updated  4 months ago

  • nemotron-3-nano

    Nemotron-3-Nano is a new Standard for Efficient, Open, and Intelligent Agentic Models, now updated with a 4B parameter count model.

    tools thinking cloud 4b 30b

    415.3K  Pulls 9  Tags Updated  1 month ago

  • minimax-m2.7

    MiniMax's M2-series model for coding, agentic workflows, and professional productivity.

    tools thinking cloud

    103K  Pulls 1  Tag Updated  1 month ago

  • olmo-3

    Olmo is a series of Open language models designed to enable the science of language models. These models are pre-trained on the Dolma 3 dataset and post-trained on the Dolci datasets.

    7b 32b

    417.8K  Pulls 15  Tags Updated  4 months ago

  • glm-5

    A strong reasoning and agentic model from Z.ai with 744B total parameters (40B active), built for complex systems engineering and long-horizon tasks.

    tools thinking cloud

    198.1K  Pulls 1  Tag Updated  2 months ago

  • deepseek-ocr

    DeepSeek-OCR is a vision-language model that can perform token-efficient OCR.

    vision 3b

    432.4K  Pulls 3  Tags Updated  5 months ago

  • minimax-m2.5

    MiniMax-M2.5 is a state-of-the-art large language model designed for real-world productivity and coding tasks.

    tools thinking cloud

    168.9K  Pulls 1  Tag Updated  2 months ago

  • olmo-3.1

    Olmo is a series of Open language models designed to enable the science of language models. These models are pre-trained on the Dolma 3 dataset and post-trained on the Dolci datasets.

    tools 32b

    266.4K  Pulls 10  Tags Updated  4 months ago

  • devstral-2

    123B model that excels at using tools to explore codebases, editing multiple files and power software engineering agents.

    tools cloud 123b

    206.7K  Pulls 6  Tags Updated  4 months ago

  • nomic-embed-text-v2-moe

    nomic-embed-text-v2-moe is a multilingual MoE text embedding model that excels at multilingual retrieval.

    embedding

    194.1K  Pulls 1  Tag Updated  4 months ago

  • functiongemma

    FunctionGemma is a specialized version of Google's Gemma 3 270M model fine-tuned explicitly for function calling.

    tools 270m

    152.6K  Pulls 4  Tags Updated  4 months ago

  • gemini-3-flash-preview

    Gemini 3 Flash offers frontier intelligence built for speed at a fraction of the cost.

    vision tools thinking cloud

    145K  Pulls 2  Tags Updated  4 months ago

  • cogito-2.1

    The Cogito v2.1 LLMs are instruction tuned generative models. All models are released under MIT license for commercial use.

    cloud 671b

    177.1K  Pulls 6  Tags Updated  5 months ago

  • glm-4.7

    Advancing the Coding Capability

    tools thinking cloud

    96K  Pulls 1  Tag Updated  4 months ago

  • deepseek-v3.2

    DeepSeek-V3.2, a model that harmonizes high computational efficiency with superior reasoning and agent performance.

    tools thinking cloud

    89.2K  Pulls 1  Tag Updated  4 months ago

© 2026 Ollama
Blog Contact