Ollama
Models Docs Pricing
Sign in Download
Models Download Docs Pricing Sign in
⇅
Ollama
Search for models on Ollama.
  • deepseek-ocr

    DeepSeek-OCR is a vision-language model that can perform token-efficient OCR.

    vision 3b

    442.6K  Pulls 3  Tags Updated  5 months ago

  • cogito-2.1

    The Cogito v2.1 LLMs are instruction tuned generative models. All models are released under MIT license for commercial use.

    cloud 671b

    184K  Pulls 6  Tags Updated  5 months ago

  • kimi-k2-thinking

    Kimi K2 Thinking, Moonshot AI's best open-source thinking model.

    tools thinking cloud

    905.8K  Pulls 1  Tag Updated  6 months ago

  • minimax-m2

    MiniMax M2 is a high-efficiency large language model built for coding and agentic workflows.

    tools thinking cloud

    951K  Pulls 1  Tag Updated  6 months ago

  • gpt-oss-safeguard

    gpt-oss-safeguard-20b and gpt-oss-safeguard-120b are safety reasoning models built-upon gpt-oss

    tools thinking 20b 120b

    139.4K  Pulls 3  Tags Updated  6 months ago

  • glm-4.6

    Advanced agentic, reasoning and coding capabilities.

    tools thinking cloud

    955.5K  Pulls 1  Tag Updated  7 months ago

  • qwen3-vl

    The most powerful vision-language model in the Qwen model family to date.

    vision tools thinking cloud 2b 4b 8b 30b 32b 235b

    3.7M  Pulls 59  Tags Updated  6 months ago

  • granite4

    Granite 4 features improved instruction following (IF) and tool-calling capabilities, making them more effective in enterprise applications.

    tools 350m 1b 3b

    1.2M  Pulls 17  Tags Updated  6 months ago

  • kimi-k2

    A state-of-the-art mixture-of-experts (MoE) language model. Kimi K2-Instruct-0905 demonstrates significant improvements in performance on public benchmarks and real-world coding agent tasks.

    tools cloud

    66.6K  Pulls 1  Tag Updated  7 months ago

  • qwen3-embedding

    Building upon the foundational models of the Qwen3 series, Qwen3 Embedding provides a comprehensive range of text embeddings models in various sizes

    embedding 0.6b 4b 8b

    1.9M  Pulls 12  Tags Updated  7 months ago

  • embeddinggemma

    EmbeddingGemma is a 300M parameter embedding model from Google.

    embedding 300m

    1.2M  Pulls 5  Tags Updated  8 months ago

  • deepseek-v3.1

    DeepSeek-V3.1-Terminus is a hybrid model that supports both thinking mode and non-thinking mode.

    tools thinking cloud 671b

    676.8K  Pulls 8  Tags Updated  7 months ago

  • gpt-oss

    OpenAI’s open-weight models designed for powerful reasoning, agentic tasks, and versatile developer use cases.

    tools thinking cloud 20b 120b

    9.5M  Pulls 5  Tags Updated  7 months ago

  • qwen3-coder

    Alibaba's performant long context models for agentic and coding tasks.

    tools cloud 30b 480b

    5.4M  Pulls 10  Tags Updated  7 months ago

  • mistral-small3.2

    An update to Mistral Small that improves on function calling, instruction following, and less repetition errors.

    vision tools 24b

    2M  Pulls 5  Tags Updated  10 months ago

  • gemma3n

    Gemma 3n models are designed for efficient execution on everyday devices such as laptops, tablets or phones.

    e2b e4b

    1.6M  Pulls 9  Tags Updated  10 months ago

  • magistral

    Magistral is a small, efficient reasoning model with 24B parameters.

    tools thinking 24b

    1.4M  Pulls 5  Tags Updated  11 months ago

  • devstral

    Devstral: the best open source model for coding agents

    tools 24b

    939.3K  Pulls 5  Tags Updated  10 months ago

  • qwen2.5vl

    Flagship vision-language model of Qwen and also a significant leap from the previous Qwen2-VL.

    vision 3b 7b 32b 72b

    1.9M  Pulls 17  Tags Updated  11 months ago

  • phi4-reasoning

    Phi 4 reasoning and reasoning plus are 14-billion parameter open-weight reasoning models that rival much larger models on complex reasoning tasks.

    14b

    1.6M  Pulls 9  Tags Updated  1 year ago

© 2026 Ollama
Blog Contact