Ollama
Models Docs Pricing
Sign in Download
Models Download Docs Pricing Sign in
⇅
Embed models · Ollama
Embed models on Ollama.
  • lfm2

    LFM2 is a family of hybrid models designed for on-device deployment. LFM2-24B-A2B is the largest model in the family, scaling the architecture to 24 billion parameters while keeping inference efficient.

    tools 24b

    1.6M  Pulls 6  Tags Updated  4 days ago

  • qwen3-coder-next

    Qwen3-Coder-Next is a coding-focused language model from Alibaba's Qwen team, optimized for agentic coding workflows and local development.

    tools cloud

    699.2K  Pulls 4  Tags Updated  3 weeks ago

  • qwen3.5

    Qwen 3.5 is a family of open-source multimodal models that delivers exceptional utility and performance.

    vision tools thinking cloud 27b 35b 122b

    89.3K  Pulls 10  Tags Updated  yesterday

  • glm-5

    A strong reasoning and agentic model from Z.ai with 744B total parameters (40B active), built for complex systems engineering and long-horizon tasks.

    cloud

    61.8K  Pulls 1  Tag Updated  2 weeks ago

  • minimax-m2.5

    MiniMax-M2.5 is a state-of-the-art large language model designed for real-world productivity and coding tasks.

    cloud

    57K  Pulls 1  Tag Updated  2 weeks ago

  • glm-ocr

    GLM-OCR is a multimodal OCR model for complex document understanding, built on the GLM-V encoder–decoder architecture.

    vision tools

    49.2K  Pulls 3  Tags Updated  3 weeks ago

  • lfm2.5-thinking

    LFM2.5 is a new family of hybrid models designed for on-device deployment.

    tools 1.2b

    665.5K  Pulls 5  Tags Updated  1 month ago

  • translategemma

    A new collection of open translation models built on Gemma 3, helping people communicate across 55 languages.

    vision 4b 12b 27b

    412.7K  Pulls 13  Tags Updated  1 month ago

  • glm-4.7-flash

    As the strongest model in the 30B class, GLM-4.7-Flash offers a new option for lightweight deployment that balances performance and efficiency.

    tools thinking

    316K  Pulls 4  Tags Updated  1 month ago

  • qwen3-vl

    The most powerful vision-language model in the Qwen model family to date.

    vision tools thinking cloud 2b 4b 8b 30b 32b 235b

    1.6M  Pulls 59  Tags Updated  4 months ago

  • ministral-3

    The Ministral 3 family is designed for edge deployment, capable of running on a wide range of hardware.

    vision tools cloud 3b 8b 14b

    495.7K  Pulls 16  Tags Updated  2 months ago

  • qwen3-embedding

    Building upon the foundational models of the Qwen3 series, Qwen3 Embedding provides a comprehensive range of text embeddings models in various sizes

    embedding 0.6b 4b 8b

    999.6K  Pulls 12  Tags Updated  5 months ago

  • granite4

    Granite 4 features improved instruction following (IF) and tool-calling capabilities, making them more effective in enterprise applications.

    tools 350m 1b 3b

    740.3K  Pulls 17  Tags Updated  4 months ago

  • rnj-1

    Rnj-1 is a family of 8B parameter open-weight, dense models trained from scratch by Essential AI, optimized for code and STEM with capabilities on par with SOTA open-weight models.

    tools cloud 8b

    328.3K  Pulls 6  Tags Updated  2 months ago

  • qwen3-next

    The first installment in the Qwen3-Next series with strong performance in terms of both parameter efficiency and inference speed.

    tools thinking cloud 80b

    345.2K  Pulls 10  Tags Updated  2 months ago

  • kimi-k2.5

    Kimi K2.5 is an open-source, native multimodal agentic model that seamlessly integrates vision and language understanding with advanced agentic capabilities, instant and thinking modes, as well as conversational and agentic paradigms.

    cloud

    96.8K  Pulls 1  Tag Updated  1 month ago

  • embeddinggemma

    EmbeddingGemma is a 300M parameter embedding model from Google.

    embedding 300m

    550.2K  Pulls 5  Tags Updated  5 months ago

  • nemotron-3-nano

    Nemotron 3 Nano - A new Standard for Efficient, Open, and Intelligent Agentic Models

    tools thinking cloud 30b

    180.4K  Pulls 6  Tags Updated  2 months ago

  • deepseek-ocr

    DeepSeek-OCR is a vision-language model that can perform token-efficient OCR.

    vision 3b

    228.2K  Pulls 3  Tags Updated  3 months ago

  • olmo-3

    Olmo is a series of Open language models designed to enable the science of language models. These models are pre-trained on the Dolma 3 dataset and post-trained on the Dolci datasets.

    7b 32b

    166.2K  Pulls 15  Tags Updated  2 months ago

© 2026 Ollama
Blog Contact