small · Ollama

smallthinker

A new small reasoning model fine-tuned from the Qwen 2.5 3B Instruct model.

244.9K Pulls 5 Tags Updated 1 year ago

devstral-small-2

24B model that excels at using tools to explore codebases, editing multiple files and power software engineering agents.

vision tools cloud 24b

832.3K Pulls 6 Tags Updated 5 months ago

mistral-small3.2

An update to Mistral Small that improves on function calling, instruction following, and less repetition errors.

vision tools 24b

2.1M Pulls 5 Tags Updated 11 months ago

magistral

Magistral is a small, efficient reasoning model with 24B parameters.

tools thinking 24b

1.4M Pulls 5 Tags Updated 11 months ago

llama3.2

Meta's Llama 3.2 goes small with 1B and 3B models.

tools 1b 3b

69.5M Pulls 63 Tags Updated 1 year ago

mistral-small

Mistral Small 3 sets a new benchmark in the “small” Large Language Models category below 70B.

tools 22b 24b

3M Pulls 21 Tags Updated 1 year ago

smollm

🪐 A family of small models with 135M, 360M, and 1.7B parameters, trained on a new high-quality dataset.

135m 360m 1.7b

1.8M Pulls 94 Tags Updated 1 year ago

mistral-small3.1

Building upon Mistral Small 3, Mistral Small 3.1 (2503) adds state-of-the-art vision understanding and enhances long context capabilities up to 128k tokens without compromising text performance.

vision tools 24b

741.8K Pulls 5 Tags Updated 1 year ago

moondream

moondream2 is a small vision language model designed to run efficiently on edge devices.

vision 1.8b

1.2M Pulls 18 Tags Updated 2 years ago

nemotron-mini

A commercial-friendly small language model by NVIDIA optimized for roleplay, RAG QA, and function calling.

tools 4b

663.6K Pulls 17 Tags Updated 1 year ago

command-r7b

The smallest model in Cohere's R series delivers top-tier speed, efficiency, and quality to build powerful AI applications on commodity GPUs and edge devices.

tools 7b

259.4K Pulls 5 Tags Updated 1 year ago

llava-phi3

A new small LLaVA model fine-tuned from Phi 3 Mini.

vision 3.8b

282.2K Pulls 4 Tags Updated 2 years ago

stablelm-zephyr

A lightweight chat model allowing accurate, and responsive output without requiring high-end hardware.

3b

507.9K Pulls 17 Tags Updated 2 years ago

stable-beluga

Llama 2 based model fine tuned on an Orca-style dataset. Originally called Free Willy.

7b 13b 70b

878.7K Pulls 49 Tags Updated 2 years ago

qwen2-math

Qwen2 Math is a series of specialized math language models built upon the Qwen2 LLMs, which significantly outperforms the mathematical capabilities of open-source models and even closed-source models (e.g., GPT4o).

1.5b 7b 72b

1M Pulls 52 Tags Updated 1 year ago

stable-code

Stable Code 3B is a coding model with instruct and code completion variants on par with models such as Code Llama 7B that are 2.5x larger.

3b

1M Pulls 36 Tags Updated 2 years ago

sailor2

Sailor2 are multilingual language models made for South-East Asia. Available in 1B, 8B, and 20B parameter sizes.

1b 8b 20b

389.4K Pulls 13 Tags Updated 1 year ago

meditron

Open-source medical large language model adapted from Llama 2 to the medical domain.

7b 70b

694.8K Pulls 22 Tags Updated 2 years ago

functiongemma

FunctionGemma is a specialized version of Google's Gemma 3 270M model fine-tuned explicitly for function calling.

tools 270m

158.3K Pulls 4 Tags Updated 5 months ago

llama3.3

New state of the art 70B model. Llama 3.3 70B offers similar performance compared to the Llama 3.1 405B model.

tools 70b

3.9M Pulls 14 Tags Updated 1 year ago