Ollama
Models GitHub Discord Docs Cloud
Sign in Download
Models Download GitHub Discord Docs Cloud Sign in
⇅
m3e · Ollama Search
Search for models on Ollama.
  • ministral-3

    The Ministral 3 family is designed for edge deployment, capable of running on a wide range of hardware.

    vision tools cloud 3b 8b 14b

    111.9K  Pulls 16  Tags Updated  6 days ago

  • mistral-small3.1

    Building upon Mistral Small 3, Mistral Small 3.1 (2503) adds state-of-the-art vision understanding and enhances long context capabilities up to 128k tokens without compromising text performance.

    vision tools 24b

    446.2K  Pulls 5  Tags Updated  8 months ago

  • mistral-large-3

    A general-purpose multimodal mixture-of-experts model for production-grade tasks and enterprise workloads.

    cloud

    4,858  Pulls 1  Tag Updated  2 weeks ago

  • mistral-small

    Mistral Small 3 sets a new benchmark in the “small” Large Language Models category below 70B.

    tools 22b 24b

    2.2M  Pulls 21  Tags Updated  10 months ago

  • granite3.1-moe

    The IBM Granite 1B and 3B models are long-context mixture of experts (MoE) Granite models from IBM designed for low latency usage.

    tools 1b 3b

    1.6M  Pulls 33  Tags Updated  11 months ago

  • orca-mini

    A general-purpose model ranging from 3 billion parameters to 70 billion, suitable for entry-level hardware.

    3b 7b 13b 70b

    1.4M  Pulls 119  Tags Updated  2 years ago

  • mistral-small3.2

    An update to Mistral Small that improves on function calling, instruction following, and less repetition errors.

    vision tools 24b

    892.4K  Pulls 5  Tags Updated  6 months ago

  • llama3-chatqa

    A model from NVIDIA based on Llama 3 that excels at conversational question answering (QA) and retrieval-augmented generation (RAG).

    8b 70b

    154.9K  Pulls 35  Tags Updated  1 year ago

  • granite-embedding

    The IBM Granite Embedding 30M and 278M models models are text-only dense biencoder embedding models, with 30M available in English only and 278M serving multilingual use cases.

    embedding 30m 278m

    144K  Pulls 6  Tags Updated  1 year ago

  • granite3-moe

    The IBM Granite 1B and 3B models are the first mixture of experts (MoE) Granite models from IBM designed for low latency usage.

    tools 1b 3b

    114.2K  Pulls 33  Tags Updated  1 year ago

  • milkey/m3e

    Moka-AI Massive Mixed Embedding

    embedding

    6,964  Pulls 7  Tags Updated  1 year ago

  • twwch/m3e-base

    embedding

    1,432  Pulls 1  Tag Updated  1 year ago

  • yxl/m3e

    Embedding

    embedding

    1,161  Pulls 3  Tags Updated  1 year ago

  • turingdance/m3e-base

    embedding

    147  Pulls 1  Tag Updated  9 months ago

  • davisgao/m3e

    embedding

    embedding

    91  Pulls 1  Tag Updated  1 year ago

  • zailiang/m3e

    embedding

    49  Pulls 1  Tag Updated  1 year ago

  • lyyyt/m3e-forensic-finetuned

    embedding

    15  Pulls 1  Tag Updated  1 month ago

  • bge-m3

    BGE-M3 is a new model from BAAI distinguished for its versatility in Multi-Functionality, Multi-Linguality, and Multi-Granularity.

    embedding 567m

    2.9M  Pulls 3  Tags Updated  1 year ago

  • monotykamary/medichat-llama3

    Built upon the powerful LLaMa-3 architecture and fine-tuned on an extensive dataset of health information, this model leverages its vast medical knowledge to offer clear, comprehensive answers.

    8b

    2,929  Pulls 6  Tags Updated  1 year ago

  • martain7r/finance-llama-8b

    Finance-Llama-8B is a fine-tuned Llama 3.1 8B model trained on 500k examples for tasks like QA, reasoning, sentiment, and NER. It supports multi-turn dialogue and is ideal for financial assistants.

    2,014  Pulls 2  Tags Updated  6 months ago

© 2025 Ollama
Download Blog Docs GitHub Discord X (Twitter) Contact Us
  • Blog
  • Download
  • Docs
  • GitHub
  • Discord
  • X (Twitter)
  • Meetups
© 2025 Ollama Inc.