SmolLM2 is a family of compact language models available in three size: 135M, 360M, and 1.7B parameters.
2.2M Pulls 49 Tags Updated 1 year ago
🪐 A family of small models with 135M, 360M, and 1.7B parameters, trained on a new high-quality dataset.
574.6K Pulls 94 Tags Updated 1 year ago
StarCoder is a code generation model trained on 80+ programming languages.
252.8K Pulls 100 Tags Updated 2 years ago
Stable Code 3B is a coding model with instruct and code completion variants on par with models such as Code Llama 7B that are 2.5x larger.
181.2K Pulls 36 Tags Updated 1 year ago
Starling is a large language model trained by reinforcement learning from AI feedback focused on improving chatbot helpfulness.
126.4K Pulls 36 Tags Updated 1 year ago
Solar Pro Preview: an advanced large language model (LLM) with 22 billion parameters designed to fit into a single GPU
72.3K Pulls 18 Tags Updated 1 year ago
DeepSeek-R1 is a family of open reasoning models with performance approaching that of leading models, such as O3 and Gemini 2.5 Pro.
74.5M Pulls 35 Tags Updated 5 months ago
DeepSeek Coder is a capable coding model trained on two trillion code and natural language tokens.
2.3M Pulls 102 Tags Updated 1 year ago
DeepCoder is a fully open-Source 14B coder model at O3-mini level, with a 1.5B version also available.
456.3K Pulls 9 Tags Updated 8 months ago
Building upon Mistral Small 3, Mistral Small 3.1 (2503) adds state-of-the-art vision understanding and enhances long context capabilities up to 128k tokens without compromising text performance.
446.1K Pulls 5 Tags Updated 8 months ago
Sentence-transformers model that can be used for tasks like clustering or semantic search.
175.2K Pulls 3 Tags Updated 1 year ago
Olmo is a series of Open language models designed to enable the science of language models. These models are pre-trained on the Dolma 3 dataset and post-trained on the Dolci datasets.
5,840 Pulls 15 Tags Updated 3 days ago
Dolphin 3.0 Llama 3.1 8B 🐬 is the next generation of the Dolphin series of instruct-tuned models designed to be the ultimate general purpose local model, enabling coding, math, agentic, function calling, and general use cases.
3.5M Pulls 5 Tags Updated 11 months ago
A state-of-the-art 12B model with 128k context length, built by Mistral AI in collaboration with NVIDIA.
3.1M Pulls 17 Tags Updated 4 months ago
A suite of text embedding models by Snowflake, optimized for performance.
1.3M Pulls 16 Tags Updated 1 year ago
An update to Mistral Small that improves on function calling, instruction following, and less repetition errors.
892.2K Pulls 5 Tags Updated 6 months ago
Snowflake's frontier embedding model. Arctic Embed 2.0 adds multilingual support without sacrificing English performance or scalability.
209.4K Pulls 3 Tags Updated 1 year ago
Aya 23, released by Cohere, is a new family of state-of-the-art, multilingual models that support 23 languages.
191.5K Pulls 33 Tags Updated 1 year ago
SQLCoder is a code completion model fined-tuned on StarCoder for SQL generation tasks
166.5K Pulls 48 Tags Updated 1 year ago
Stable LM 2 is a state-of-the-art 1.6B and 12B parameter language model trained on multilingual data in English, Spanish, German, Italian, French, Portuguese, and Dutch.
157.9K Pulls 84 Tags Updated 1 year ago