Ollama
Models GitHub Discord Docs Pricing
Sign in Download
Models Download GitHub Discord Docs Pricing Sign in
⇅
lam · Ollama
Search for models on Ollama.
  • llama3-gradient

    This model extends LLama-3 8B's context length from 8k to over 1m tokens.

    8b 70b

    307.7K  Pulls 35  Tags Updated  1 year ago

  • llama3.1

    Llama 3.1 is a new state-of-the-art model from Meta available in 8B, 70B and 405B parameter sizes.

    tools 8b 70b 405b

    109.6M  Pulls 93  Tags Updated  1 year ago

  • llama3.3

    New state of the art 70B model. Llama 3.3 70B offers similar performance compared to the Llama 3.1 405B model.

    tools 70b

    3.2M  Pulls 14  Tags Updated  1 year ago

  • lfm2.5-thinking

    LFM2.5 is a new family of hybrid models designed for on-device deployment.

    1.2b

    27.7K  Pulls 5  Tags Updated  1 week ago

  • qwen3-vl

    The most powerful vision-language model in the Qwen model family to date.

    vision tools thinking cloud 2b 4b 8b 30b 32b 235b

    1.3M  Pulls 59  Tags Updated  3 months ago

  • olmo-3

    Olmo is a series of Open language models designed to enable the science of language models. These models are pre-trained on the Dolma 3 dataset and post-trained on the Dolci datasets.

    7b 32b

    114.5K  Pulls 15  Tags Updated  1 month ago

  • deepseek-ocr

    DeepSeek-OCR is a vision-language model that can perform token-efficient OCR.

    vision 3b

    143.1K  Pulls 3  Tags Updated  2 months ago

  • olmo-3.1

    Olmo is a series of Open language models designed to enable the science of language models. These models are pre-trained on the Dolma 3 dataset and post-trained on the Dolci datasets.

    tools 32b

    68.3K  Pulls 10  Tags Updated  1 month ago

  • minimax-m2

    MiniMax M2 is a high-efficiency large language model built for coding and agentic workflows.

    cloud

    47.6K  Pulls 1  Tag Updated  3 months ago

  • kimi-k2

    A state-of-the-art mixture-of-experts (MoE) language model. Kimi K2-Instruct-0905 demonstrates significant improvements in performance on public benchmarks and real-world coding agent tasks.

    cloud

    34.6K  Pulls 1  Tag Updated  4 months ago

  • mistral-large-3

    A general-purpose multimodal mixture-of-experts model for production-grade tasks and enterprise workloads.

    cloud

    14.4K  Pulls 1  Tag Updated  2 months ago

  • qwen3

    Qwen3 is the latest generation of large language models in Qwen series, offering a comprehensive suite of dense and mixture-of-experts (MoE) models.

    tools thinking 0.6b 1.7b 4b 8b 14b 30b 32b 235b

    18.4M  Pulls 58  Tags Updated  3 months ago

  • qwen2.5vl

    Flagship vision-language model of Qwen and also a significant leap from the previous Qwen2-VL.

    vision 3b 7b 32b 72b

    1.2M  Pulls 17  Tags Updated  8 months ago

  • phi4-reasoning

    Phi 4 reasoning and reasoning plus are 14-billion parameter open-weight reasoning models that rival much larger models on complex reasoning tasks.

    14b

    1.1M  Pulls 9  Tags Updated  9 months ago

  • llama4

    Meta's latest collection of multimodal models.

    vision tools 16x17b 128x17b

    1.1M  Pulls 11  Tags Updated  7 months ago

  • granite3.3

    IBM Granite 2B and 8B models are 128K context length language models that have been fine-tuned for improved reasoning and instruction-following capabilities.

    tools 2b 8b

    862.1K  Pulls 3  Tags Updated  9 months ago

  • granite3.2-vision

    A compact and efficient vision-language model, specifically designed for visual document understanding, enabling automated content extraction from tables, charts, infographics, plots, diagrams, and more.

    vision tools 2b

    702.9K  Pulls 5  Tags Updated  11 months ago

  • llama3.2

    Meta's Llama 3.2 goes small with 1B and 3B models.

    tools 1b 3b

    55.3M  Pulls 63  Tags Updated  1 year ago

  • qwen2.5

    Qwen2.5 models are pretrained on Alibaba's latest large-scale dataset, encompassing up to 18 trillion tokens. The model supports up to 128K tokens and has multilingual support.

    tools 0.5b 1.5b 3b 7b 14b 32b 72b

    20M  Pulls 133  Tags Updated  1 year ago

  • qwen2.5-coder

    The latest series of Code-Specific Qwen models, with significant improvements in code generation, code reasoning, and code fixing.

    tools 0.5b 1.5b 3b 7b 14b 32b

    10.7M  Pulls 199  Tags Updated  8 months ago

© 2026 Ollama
Blog Contact