Ollama
Models GitHub Discord Docs Cloud
Sign in Download
Models Download GitHub Discord Docs Cloud Sign in
⇅
llama4 · Ollama Search
Search for models on Ollama.
  • llama4

    Meta's latest collection of multimodal models.

    vision tools 16x17b 128x17b

    699.4K  Pulls 11  Tags Updated  3 months ago

  • llama3.1

    Llama 3.1 is a new state-of-the-art model from Meta available in 8B, 70B and 405B parameter sizes.

    tools 8b 70b 405b

    103.1M  Pulls 93  Tags Updated  9 months ago

  • llama3.3

    New state of the art 70B model. Llama 3.3 70B offers similar performance compared to the Llama 3.1 405B model.

    tools 70b

    2.5M  Pulls 14  Tags Updated  9 months ago

  • firefunction-v2

    An open weights function calling model based on Llama 3, competitive with GPT-4o function calling capabilities.

    tools 70b

    33.6K  Pulls 17  Tags Updated  1 year ago

  • aravhawk/llama4

    The Llama 4 models are Meta's flagship LLMs. These models leverage a mixture-of-experts architecture to offer industry-leading performance in text understanding and generation.

    109b 400b

    5,994  Pulls 5  Tags Updated  5 months ago

  • ingu627/llama4-scout-q4

    The Llama 4 collection of models are natively multimodal AI models that enable text and multimodal experiences. These models leverage a mixture-of-experts architecture to offer industry-leading performance in text and image understanding.

    109b 400b

    3,250  Pulls 3  Tags Updated  4 months ago

  • johanteekens/Llama-4-Scout-17B-16E-Instruct

    Works, thanks ollama team for supporting Llama4!

    907  Pulls 1  Tag Updated  5 months ago

  • compcj/llama4-scout-ud-q2-k-xl

    unsloth's v2.0 dynamic quants of Llama-4-Scout-17B-16E-Instruct-GGUF, Q2_K_XL(2.71-bit)

    tools

    281  Pulls 1  Tag Updated  4 months ago

  • dokterbob/unsloth-llama4-scout

    tools

    56  Pulls 1  Tag Updated  3 months ago

  • hhao/openbmb-minicpm-llama3-v-2_5

    MiniCPM-V surpasses proprietary models such as GPT-4V, Gemini Pro, Qwen-VL and Claude 3 in overall performance, and support multimodal conversation for over 30 languages.

    vision

    44.6K  Pulls 8  Tags Updated  1 year ago

  • huihui_ai/llama3.3-abliterated

    New state of the art 70B model. Llama 3.3 70B offers similar performance compared to Llama 3.1 405B model.

    tools 70b

    22K  Pulls 10  Tags Updated  9 months ago

  • wangshenzhi/llama3-8b-chinese-chat-ollama-q4

    The ollama model for the 4bit-quantized GGUF version of llama3-8b-chinese-chat (https://huggingface.co/shenzhi-wang/Llama3-8B-Chinese-Chat-GGUF-4bit).

    7,442  Pulls 2  Tags Updated  1 year ago

  • JollyLlama/GLM-4-32B-0414-Q4_K_M

    This model requires Ollama v0.6.6 or later

    3,677  Pulls 1  Tag Updated  5 months ago

  • nsheth/llama-3-lumimaid-8b-v0.1-iq-imatrix

    It uses this one Q4_K_M-imat (4.89 BPW) quant for up to 12288 context sizes. for less than 8gb vram

    vision

    3,219  Pulls 1  Tag Updated  1 year ago

  • wangshenzhi/llama3-70b-chinese-chat-ollama-q4

    The ollama model for the 4bit-quantized GGUF version of llama3-70b-chinese-chat (https://huggingface.co/shenzhi-wang/Llama3-70B-Chinese-Chat-GGUF-4bit).

    2,752  Pulls 1  Tag Updated  1 year ago

  • HammerAI/hermes-3-llama-3.1

    NousResearch/Hermes-3-Llama-3.1-405B

    tools

    2,720  Pulls 2  Tags Updated  1 year ago

  • MHKetbi/nvidia_Llama-3.3-Nemotron-Super-49B-v1

    reasoning model that is post trained for reasoning, human chat preferences, and tasks, such as RAG and tool calling.

    2,212  Pulls 6  Tags Updated  6 months ago

  • catsarethebest/llama3.2-4oClaude

    Llama3.2 1b trained on data distilled from gpt4o, claude3.5 and claude opus

    tools

    939  Pulls 2  Tags Updated  2 months ago

  • blackened/llama-3-8b-gpt-4o-ru1.0

    https://habr.com/ru/articles/830332/

    791  Pulls 1  Tag Updated  1 year ago

  • trinsition/minicpmv

    minicpm-llama3-2.5-8b-16-v With only 8B parameters, it surpasses widely used proprietary models like GPT-4V-1106, Gemini Pro, Claude 3 and Qwen-VL-Max and greatly outperforms other Llama 3-based MLLMs

    481  Pulls 1  Tag Updated  1 year ago

© 2025 Ollama
Download Blog Docs GitHub Discord X (Twitter) Contact Us
  • Blog
  • Download
  • Docs
  • GitHub
  • Discord
  • X (Twitter)
  • Meetups
© 2025 Ollama Inc.