SmolLM2 is a family of compact language models available in three size: 135M, 360M, and 1.7B parameters.
3.4M Pulls 49 Tags Updated 1 year ago
🪐 A family of small models with 135M, 360M, and 1.7B parameters, trained on a new high-quality dataset.
1.8M Pulls 94 Tags Updated 1 year ago
Stable Code 3B is a coding model with instruct and code completion variants on par with models such as Code Llama 7B that are 2.5x larger.
1M Pulls 36 Tags Updated 2 years ago
StarCoder is a code generation model trained on 80+ programming languages.
1.1M Pulls 100 Tags Updated 2 years ago
Starling is a large language model trained by reinforcement learning from AI feedback focused on improving chatbot helpfulness.
920.5K Pulls 36 Tags Updated 2 years ago
Solar Pro Preview: an advanced large language model (LLM) with 22 billion parameters designed to fit into a single GPU
532.6K Pulls 18 Tags Updated 1 year ago
Laguna XS.2 is a 33B total parameter Mixture-of-Experts model with 3B activated parameters per token designed for agentic coding and long-horizon work on a local machine.
11.8K Pulls 7 Tags Updated 3 weeks ago
DeepSeek-R1 is a family of open reasoning models with performance approaching that of leading models, such as O3 and Gemini 2.5 Pro.
85.8M Pulls 35 Tags Updated 10 months ago
DeepSeek Coder is a capable coding model trained on two trillion code and natural language tokens.
4.2M Pulls 102 Tags Updated 2 years ago
DeepCoder is a fully open-Source 14B coder model at O3-mini level, with a 1.5B version also available.
869.1K Pulls 9 Tags Updated 1 year ago
Building upon Mistral Small 3, Mistral Small 3.1 (2503) adds state-of-the-art vision understanding and enhances long context capabilities up to 128k tokens without compromising text performance.
742.6K Pulls 5 Tags Updated 1 year ago
Sentence-transformers model that can be used for tasks like clustering or semantic search.
844.8K Pulls 3 Tags Updated 1 year ago
Kimi K2.6 is an open-source, native multimodal agentic model that advances practical capabilities in long-horizon coding, coding-driven design, proactive autonomous execution, and swarm-based task orchestration.
253K Pulls 1 Tag Updated 4 weeks ago
A strong reasoning and agentic model from Z.ai with 744B total parameters (40B active), built for complex systems engineering and long-horizon tasks.
1.9M Pulls 1 Tag Updated 3 months ago
As the strongest model in the 30B class, GLM-4.7-Flash offers a new option for lightweight deployment that balances performance and efficiency.
1.2M Pulls 4 Tags Updated 3 months ago
Kimi K2.5 is an open-source, native multimodal agentic model that seamlessly integrates vision and language understanding with advanced agentic capabilities, instant and thinking modes, as well as conversational and agentic paradigms.
288.5K Pulls 1 Tag Updated 3 months ago
Olmo is a series of Open language models designed to enable the science of language models. These models are pre-trained on the Dolma 3 dataset and post-trained on the Dolci datasets.
429.1K Pulls 15 Tags Updated 5 months ago
273.3K Pulls 10 Tags Updated 5 months ago
An update to Mistral Small that improves on function calling, instruction following, and less repetition errors.
2.1M Pulls 5 Tags Updated 11 months ago
Dolphin 3.0 Llama 3.1 8B 🐬 is the next generation of the Dolphin series of instruct-tuned models designed to be the ultimate general purpose local model, enabling coding, math, agentic, function calling, and general use cases.
3.8M Pulls 5 Tags Updated 1 year ago