falcon

falcon3

A family of efficient AI models under 10B parameters performant in science, math, and coding through innovative training techniques.

1b 3b 7b 10b

2.6M Pulls 17 Tags Updated 1 year ago

A large language model built by the Technology Innovation Institute (TII) for use in summarization, text generation, and chat bots.

7b 40b 180b

1.1M Pulls 38 Tags Updated 2 years ago

falcon2

Falcon2 is an 11B parameters causal decoder-only model built by TII and trained over 5T tokens.

11b

528K Pulls 17 Tags Updated 2 years ago

granite3.2

Granite-3.2 is a family of long-context AI models from IBM Granite fine-tuned for thinking capabilities.

tools 2b 8b

447.2K Pulls 9 Tags Updated 1 year ago

lfm2.5

LFM2.5-8B-A1B, an edge model built for fast, reliable tool calling on consumer hardware.

tools thinking 8b

96.7K Pulls 5 Tags Updated 2 months ago

mistrallite

MistralLite is a fine-tuned model based on Mistral with enhanced capabilities of processing long contexts.

7b

512K Pulls 17 Tags Updated 2 years ago

qwen3-coder-next

Qwen3-Coder-Next is a coding-focused language model from Alibaba's Qwen team, optimized for agentic coding workflows and local development.

tools

1.9M Pulls 3 Tags Updated 5 months ago

emulayt/falcon-tiny-r-q4km

18 Pulls 1 Tag Updated 5 months ago

sam860/falcon-h1

1.5b

417 Pulls 5 Tags Updated 9 months ago

ExpedientFalcon/Qwen3-4B-UD-Q5_K_XL

Qwen3-4B Q5_K_XL Unsloth UD 2.0 adaptively quantized model, much better for coding than vanilla Q4_K_M quants without taking up the VWAM of an 8-bit Q8_0 model. From https://huggingface.co/unsloth/Qwen3-4B-GGUF/tree/main

tools

447.1K Pulls 1 Tag Updated 1 year ago

Lalit08/falcon-mamba-altruist

5 Pulls 1 Tag Updated 4 months ago

ExpedientFalcon/qwen3-reranker

1,627 Pulls 5 Tags Updated 12 months ago

huihui_ai/falcon3-abliterated

A family of efficient AI models under 10B parameters performant in science, math, and coding through innovative training techniques.

1b 3b 7b 10b

1,902 Pulls 21 Tags Updated 1 year ago

GFalcon-UA/dolphin3-r1-mistral

https://huggingface.co/cognitivecomputations/Dolphin3.0-R1-Mistral-24B

1,374 Pulls 1 Tag Updated 1 year ago

GFalcon-UA/nous-hermes-2-vision

llava-NousResearch_Nous-Hermes-2-Vision-GGUF_Q4_0 with function calling

1,759 Pulls 1 Tag Updated 2 years ago

GFalcon-UA/dolphin3-llama3.1

https://huggingface.co/cognitivecomputations/Dolphin3.0-Llama3.1-8B-GGUF

555 Pulls 1 Tag Updated 1 year ago

GFalcon-UA/dolphin3-mistral

https://huggingface.co/cognitivecomputations/Dolphin3.0-Mistral-24B

540 Pulls 1 Tag Updated 1 year ago

Hudson/falcon-mamba-instruct

The latest and greatest model of the Falcon LLM series.

583 Pulls 1 Tag Updated 1 year ago

ExpedientFalcon/qwen3-1.7b-autocomplete

tools thinking

306 Pulls 1 Tag Updated 1 year ago

ExpedientFalcon/qwen2.5-coder-3b-instruct-q6_k

This repo contains the instruction-tuned 3B Qwen2.5-Coder model in the GGUF Format: https://huggingface.co/Qwen/Qwen2.5-Coder-3B-Instruct-GGUF/tree/main

tools

271 Pulls 1 Tag Updated 1 year ago