Ollama
Models Docs Pricing
Sign in Download
Models Download Docs Pricing Sign in
⇅
falcon · Ollama
Search for models on Ollama.
  • falcon3

    A family of efficient AI models under 10B parameters performant in science, math, and coding through innovative training techniques.

    1b 3b 7b 10b

    2.5M  Pulls 17  Tags Updated  1 year ago

  • falcon

    A large language model built by the Technology Innovation Institute (TII) for use in summarization, text generation, and chat bots.

    7b 40b 180b

    1M  Pulls 38  Tags Updated  2 years ago

  • falcon2

    Falcon2 is an 11B parameters causal decoder-only model built by TII and trained over 5T tokens.

    11b

    495K  Pulls 17  Tags Updated  1 year ago

  • granite3.2

    Granite-3.2 is a family of long-context AI models from IBM Granite fine-tuned for thinking capabilities.

    tools 2b 8b

    422.6K  Pulls 9  Tags Updated  1 year ago

  • mistrallite

    MistralLite is a fine-tuned model based on Mistral with enhanced capabilities of processing long contexts.

    7b

    483.9K  Pulls 17  Tags Updated  2 years ago

  • qwen3-coder-next

    Qwen3-Coder-Next is a coding-focused language model from Alibaba's Qwen team, optimized for agentic coding workflows and local development.

    tools cloud

    1.1M  Pulls 4  Tags Updated  2 months ago

  • ExpedientFalcon/Qwen3-4B-UD-Q5_K_XL

    Qwen3-4B Q5_K_XL Unsloth UD 2.0 adaptively quantized model, much better for coding than vanilla Q4_K_M quants without taking up the VWAM of an 8-bit Q8_0 model. From https://huggingface.co/unsloth/Qwen3-4B-GGUF/tree/main

    tools

    447K  Pulls 1  Tag Updated  11 months ago

  • ExpedientFalcon/qwen3-reranker

    1,428  Pulls 5  Tags Updated  8 months ago

  • sam860/falcon-h1

    1.5b

    248  Pulls 5  Tags Updated  6 months ago

  • emulayt/falcon-tiny-r-q4km

    11  Pulls 1  Tag Updated  2 months ago

  • Lalit08/falcon-mamba-altruist

    2  Pulls 1  Tag Updated  3 weeks ago

  • ExpedientFalcon/qwen3-1.7b-autocomplete

    tools thinking

    263  Pulls 1  Tag Updated  10 months ago

  • ExpedientFalcon/qwen3-embedding

    embedding

    216  Pulls 4  Tags Updated  8 months ago

  • ExpedientFalcon/qwen2.5-coder-3b-instruct-q6_k

    This repo contains the instruction-tuned 3B Qwen2.5-Coder model in the GGUF Format: https://huggingface.co/Qwen/Qwen2.5-Coder-3B-Instruct-GGUF/tree/main

    tools

    215  Pulls 1  Tag Updated  11 months ago

  • ExpedientFalcon/qwen3-14b-agent-m2max

    tools thinking

    61  Pulls 1  Tag Updated  10 months ago

  • ExpedientFalcon/qwen3-4b-agent

    Model with tweaked params optimized for agent use

    tools

    59  Pulls 1  Tag Updated  10 months ago

  • ExpedientFalcon/qwen3-32b-agent

    Model with tweaked params optimized for agent use

    tools thinking

    50  Pulls 1  Tag Updated  10 months ago

  • ExpedientFalcon/qwen3-14b-agent-syndra

    tools thinking

    25  Pulls 1  Tag Updated  9 months ago

  • huihui_ai/falcon3-abliterated

    A family of efficient AI models under 10B parameters performant in science, math, and coding through innovative training techniques.

    1b 3b 7b 10b

    1,629  Pulls 21  Tags Updated  1 year ago

  • GFalcon-UA/nous-hermes-2-vision

    llava-NousResearch_Nous-Hermes-2-Vision-GGUF_Q4_0 with function calling

    1,722  Pulls 1  Tag Updated  1 year ago

© 2026 Ollama
Blog Contact