A family of efficient AI models under 10B parameters performant in science, math, and coding through innovative training techniques.
2.5M Pulls 17 Tags Updated 1 year ago
A large language model built by the Technology Innovation Institute (TII) for use in summarization, text generation, and chat bots.
1M Pulls 38 Tags Updated 2 years ago
Falcon2 is an 11B parameters causal decoder-only model built by TII and trained over 5T tokens.
495K Pulls 17 Tags Updated 1 year ago
Granite-3.2 is a family of long-context AI models from IBM Granite fine-tuned for thinking capabilities.
422.6K Pulls 9 Tags Updated 1 year ago
MistralLite is a fine-tuned model based on Mistral with enhanced capabilities of processing long contexts.
483.9K Pulls 17 Tags Updated 2 years ago
Qwen3-Coder-Next is a coding-focused language model from Alibaba's Qwen team, optimized for agentic coding workflows and local development.
1.1M Pulls 4 Tags Updated 2 months ago
Qwen3-4B Q5_K_XL Unsloth UD 2.0 adaptively quantized model, much better for coding than vanilla Q4_K_M quants without taking up the VWAM of an 8-bit Q8_0 model. From https://huggingface.co/unsloth/Qwen3-4B-GGUF/tree/main
447K Pulls 1 Tag Updated 11 months ago
1,428 Pulls 5 Tags Updated 8 months ago
248 Pulls 5 Tags Updated 6 months ago
11 Pulls 1 Tag Updated 2 months ago
2 Pulls 1 Tag Updated 3 weeks ago
263 Pulls 1 Tag Updated 10 months ago
216 Pulls 4 Tags Updated 8 months ago
This repo contains the instruction-tuned 3B Qwen2.5-Coder model in the GGUF Format: https://huggingface.co/Qwen/Qwen2.5-Coder-3B-Instruct-GGUF/tree/main
215 Pulls 1 Tag Updated 11 months ago
61 Pulls 1 Tag Updated 10 months ago
Model with tweaked params optimized for agent use
59 Pulls 1 Tag Updated 10 months ago
50 Pulls 1 Tag Updated 10 months ago
25 Pulls 1 Tag Updated 9 months ago
1,629 Pulls 21 Tags Updated 1 year ago
llava-NousResearch_Nous-Hermes-2-Vision-GGUF_Q4_0 with function calling
1,722 Pulls 1 Tag Updated 1 year ago