A new small reasoning model fine-tuned from the Qwen 2.5 3B Instruct model.
85.5K Pulls 5 Tags Updated 10 months ago
Meta's Llama 3.2 goes small with 1B and 3B models.
42.9M Pulls 63 Tags Updated 1 year ago
Mistral Small 3 sets a new benchmark in the “small” Large Language Models category below 70B.
2.1M Pulls 21 Tags Updated 9 months ago
An update to Mistral Small that improves on function calling, instruction following, and less repetition errors.
721.8K Pulls 5 Tags Updated 4 months ago
Magistral is a small, efficient reasoning model with 24B parameters.
594.7K Pulls 5 Tags Updated 4 months ago
🪐 A family of small models with 135M, 360M, and 1.7B parameters, trained on a new high-quality dataset.
488.2K Pulls 94 Tags Updated 1 year ago
moondream2 is a small vision language model designed to run efficiently on edge devices.
374.3K Pulls 18 Tags Updated 1 year ago
Building upon Mistral Small 3, Mistral Small 3.1 (2503) adds state-of-the-art vision understanding and enhances long context capabilities up to 128k tokens without compromising text performance.
348K Pulls 5 Tags Updated 6 months ago
A new small LLaVA model fine-tuned from Phi 3 Mini.
128.9K Pulls 4 Tags Updated 1 year ago
A commercial-friendly small language model by NVIDIA optimized for roleplay, RAG QA, and function calling.
109.9K Pulls 17 Tags Updated 1 year ago
The smallest model in Cohere's R series delivers top-tier speed, efficiency, and quality to build powerful AI applications on commodity GPUs and edge devices.
69.5K Pulls 5 Tags Updated 9 months ago
A lightweight chat model allowing accurate, and responsive output without requiring high-end hardware.
53.9K Pulls 17 Tags Updated 1 year ago
Llama 2 based model fine tuned on an Orca-style dataset. Originally called Free Willy.
74.9K Pulls 49 Tags Updated 2 years ago
Qwen2 Math is a series of specialized math language models built upon the Qwen2 LLMs, which significantly outperforms the mathematical capabilities of open-source models and even closed-source models (e.g., GPT4o).
175.6K Pulls 52 Tags Updated 1 year ago
Stable Code 3B is a coding model with instruct and code completion variants on par with models such as Code Llama 7B that are 2.5x larger.
158.1K Pulls 36 Tags Updated 1 year ago
Open-source medical large language model adapted from Llama 2 to the medical domain.
94.2K Pulls 22 Tags Updated 1 year ago
Sailor2 are multilingual language models made for South-East Asia. Available in 1B, 8B, and 20B parameter sizes.
36.4K Pulls 13 Tags Updated 11 months ago
New state of the art 70B model. Llama 3.3 70B offers similar performance compared to the Llama 3.1 405B model.
2.7M Pulls 14 Tags Updated 10 months ago
Phi-4-mini brings significant enhancements in multilingual support, reasoning, and mathematics, and now, the long-awaited function calling feature is finally supported.
466K Pulls 5 Tags Updated 8 months ago
A strong multi-lingual general language model with competitive performance to Llama 3.
162.3K Pulls 32 Tags Updated 1 year ago