I upload models and quants that I like, or would like people to have easy downloads for.
-
LFM2
Small models made to run on mobile devices, developed by Liquid AI
350m 700m 1.2b 2.6b 8b8,033 Pulls 19 Tags Updated 6 months ago
-
dolphin3-llama3.2
Dolphin 3.0 llama3.2 🐬 - A powerful, customizable AI model for local use.
1b 3b3,111 Pulls 9 Tags Updated 1 year ago
-
qwen3
Useful Unsloth-DQ2 quants of the smaller qwen3 models
tools thinking 1.7b 4b 8b3,043 Pulls 21 Tags Updated 8 months ago
-
lfm2.5
An upgraded version of LFM2 trained on over twice as many tokens
350m 1.2b2,862 Pulls 4 Tags Updated 2 months ago
-
dolphin3-qwen2.5
Dolphin 3.0 Qwen 2.5 🐬 - A powerful, customizable AI model for local use.
tools 1.5b 3b2,647 Pulls 9 Tags Updated 1 year ago
-
qwen3-rerankertools
2,336 Pulls 2 Tags Updated 1 year ago
-
deepseek-r1-0528-qwen3
Useful DQ2 quants for deepseek-r1-0528-qwen3-distill:8b
8b2,003 Pulls 3 Tags Updated 1 year ago
-
granite-4.0
Efficient, intelligent, and tiny models from IBM
tools 350m 1b 3.2b 3.4b 7b1,271 Pulls 17 Tags Updated 7 months ago
-
qwen3-embeddingtools
873 Pulls 2 Tags Updated 1 year ago
-
lucy
A compact model focused on agentic web search
tools 1.7b801 Pulls 6 Tags Updated 10 months ago
-
granite-embedding-multilingual
Embedding models supporting multiple languages, made by the IBM-Granite team, in 2 (small) sizes
embedding794 Pulls 4 Tags Updated 1 year ago
-
exaone-4.0
LGAI has developed their latest version of the exaone model series, now with reasoning!
1.2b736 Pulls 4 Tags Updated 9 months ago
-
VibeThinkertools
669 Pulls 2 Tags Updated 7 months ago
-
phi4-mini
Phi-4-mini is the latest small LLM from microsoft. These are the quants that I like to use, that weren't uploaded to the main model by the ollama team.
tools 3.8b589 Pulls 3 Tags Updated 1 year ago
-
olmoe-1b-7b-0924
An MoE model developed by allenai competitive with llama2
584 Pulls 5 Tags Updated 1 year ago
-
olmo2
Allen Institute's Latest tiny LLM in the Olmo family
1b575 Pulls 6 Tags Updated 10 months ago
-
jamba-reasoning
A small and efficient reasoning model, with a hybrid transformer and mamba architecture
tools 3b574 Pulls 4 Tags Updated 8 months ago
-
amoral-gemma3-1b-v2
gemma3 finetuned for neutral, non-judgemental responses
489 Pulls 6 Tags Updated 11 months ago
-
falcon-h11.5b
380 Pulls 5 Tags Updated 8 months ago
-
gemma3
Google's new small model
270m 1b239 Pulls 10 Tags Updated 10 months ago
-
deepseek-r1-qwen-distill
Useful DQ2 quants for deepseek-r1-qwen-distill:1.5b
1.5b227 Pulls 6 Tags Updated 1 year ago
-
gemma3n
Google's MatFormer addition to the gemma3 family
e2b216 Pulls 6 Tags Updated 9 months ago
-
granite-embedding-english
Embedding models for English language, made by the IBM-Granite team, in 2 (small) sizes
embedding111 Pulls 4 Tags Updated 1 year ago
-
granite3.3
Additional quants of the granite3.3:2b model
tools 2b13 Pulls 5 Tags Updated 10 months ago
-
tulu3
A model finetuned from llama3.1, developed by allenai
5 Pulls 6 Tags Updated 1 year ago